Data Science and Informatics
- Data Wrangling TranslationsCommon data wrangling methods (like filtering, sorting, and adding columns) in JavaScript, Python, SQL, R, and Excel. All examples use mock data.
- How to Debug Small Programming Scripts"This methodology will not find every bug in every program, but it is highly effective for the sort of short programs that beginner programmers are assigned as homework. These techniques then scale up to finding bugs in non-trivial programs."
- OpenRefineOpenRefine (formerly Google Refine) is a powerful tool for working with messy data: cleaning it; transforming it from one format into another; and extending it with web services and external data.
- Data Science: A First Introduction"an open source textbook aimed at introducing undergraduate students to data science. [...] In this book, we define data science as the study and development of reproducible, auditable processes to obtain value (i.e., insight) from data." Uses R's tidyverse packages and Jupyter notebooks.
- Practical Computing for Biologists byCall Number: QH 324.2 .H33 2011ISBN: 9780878933914Publication Date: 2010"Although many of the techniques are relevant to molecular bioinformatics, the motivation for the text is much broader, focusing on topics and techniques that are applicable to a range of scientific endeavors."
- Exploratory DesktopExploratory Desktop provides an advanced, interactive, and reproducible data wrangling and analysis experience powered by R and visualization.
- R for data science : import, tidy, transform, visualize, and model data byISBN: 9781491910368Publication Date: 2017Clearly written guide to using R for data science, visualization, tidying data, and more, by the author of "tidyverse" package. Highly recommended!
- Python Data Science Handbook byCall Number: QA76.73.P98 V365 2016 (Youngblood Energy Library)ISBN: 9781491912058Publication Date: 2016"For many researchers, Python is a first-class tool mainly because of its libraries for storing, manipulating, and gaining insight from data. Several resources exist for individual pieces of this data science stack, but only with the Python Data Science Handbook do you get them all--IPython, NumPy, Pandas, Matplotlib, Scikit-Learn, and other related tools. Working scientists and data crunchers familiar with reading and writing Python code will find this comprehensive desk reference ideal for tackling day-to-day issues: manipulating, transforming, and cleaning data; visualizing different types of data; and using data to build statistical or machine learning models. "
- Python Data Analytics byISBN: 1484209583Publication Date: 2015"Python Data Analytics will help you tackle the world of data acquisition and analysis using the power of the Python language. At the heart of this book lies the coverage of pandas, an open source, BSD-licensed library providing high-performance, easy-to-use data structures and data analysis tools for the Python programming language. "
R Basics
- A Beginner's Guide to R byISBN: 1282292161Publication Date: 2009Based on their extensive experience with teaching R & statistics to applied scientists, the authors provide a beginner's guide to R. To avoid the difficulty of teaching R & statistics at the same time, statistical methods are kept to a minimum.
- An Introduction to RIntroduction from the R core team
- swirlSwirl teaches you R programming and data science interactively, at your own pace, and right in the R console
- Data Carpentry R LessonFollow the tutorials taught in Data Carpentry to learn R.
- Mastering R Shiny"Shiny is a framework for creating web applications using R code. It is designed primarily with data scientists in mind, and to that end, you can create pretty complicated Shiny apps with no knowledge of HTML, CSS, or JavaScript. "
- Handling Errors & Warnings in R | List of Typical Messages & How to Solve"a list of typical errors and warnings. When clicking on the bullet points of the list, you are headed to detailed instructions on how to deal with the corresponding error or warning message."
- R Graph Gallery"Hundreds of charts are displayed in several sections, always with their reproducible code available. The gallery makes a focus on the tidyverse and ggplot2."
Python Basics
- Software Carpentry Python Lesson"This lesson is an introduction to programming in Python for people with little or no previous programming experience. It uses plotting as its motivating example [...] This lesson references JupyterLab, but can be taught using a regular Python interpreter as well. Please note that this lesson uses Python 3 rather than Python 2."
- Learn Python the Hard WayExercise-based guide to coding in Python.
- Full Stack Python"Full Stack Python is an open book that explains concepts in plain language. "
- Basic Python in Codewars"Achieve mastery through challenge. Improve your skills by training with others on real code challenges." Suitable for those with some Python experience.
- The Python Package Index"The Python Package Index (PyPI) is a repository of software for the Python programming language." The documentation for packages is often quite thorough and community forums can be very helpful.
- Python Graph Gallery"The Python Graph Gallery is a collection of hundreds of charts made with Python. Each example is accompanied by its corresponding reproducible code along with comprehensive explanations. The gallery offers tutorials that cater to beginners to help kickstart their journey, as well as advanced examples that demonstrate the potency of Python in the realm of data visualization."
Excel overview
- How to use Excel: A beginner's guide to Microsoft's spreadsheet program"Here you'll find how to get started with Excel — everything from performing simple calculations to making pivot tables to specifying custom print areas and customizing your view."
- Using Microsoft 365 with Windows 8: ExcelThis is chapter 13, Excel, of Using Microsoft 365 with Windows 8. It provides an illustrated walkthrough of basic Excel features.
- Quick Start for Microsoft ExcelMicrosoft documentation for Excel. Click "Next: Intro to Excel" to continue tour of Excel features with step-by-step illustrated instructions for Excel 365 on both Windows and Mac.
- Pro Office for iPad: how to be productive with Office for iPad byCall Number: eBookISBN: 9781430245889Publication Date: 2014Pages 151-245 cover using Microsoft Excel on iPads.
- Excel tips"You can master Excel by reading these tips and tricks on how to add a drop-down list in an Excel cell to finding duplicates, how to delete blank rows in Excel, and more" List of 83 links to tutorials on common Excel tasks.
- Excel Essential Training (Office 365/Microsoft 365)LinkedIn Learning video. Use your OU email and password to sign in. Search LinkedIn Learning for similar videos for Excel 2019 and 2016
More information from OU IT on LinkedIn at https://itsupport.ou.edu/TDClient/35/Norman/KB/ArticleDet?ID=13&SIDs=110 - Master Microsoft ExcelLinkedIn Learning, learning path - a series of videos on a specific topic. Use your OU email and password to sign in. More information from OU IT on LinkedIn at https://itsupport.ou.edu/TDClient/35/Norman/KB/ArticleDet?ID=13&SIDs=110
- Excel for Chemists byCall Number: QD 39.3 .S67 B55 2011 (also online)ISBN: 9780471220589Publication Date: 2002Chemists across all subdisciplines use Excel to record data in tabular form, but few have learned to take full advantage of the scientific calculating power within this program. Excel is capable of helping chemists process, analyze, and present scientific data, from the relatively simple to the highly complex.
MATLAB data analysis
- MATLAB Download from OU IT"MATLAB is available to faculty, staff, and students on the Windows, Mac OS X and Linux operating systems. MATLAB is licensed for teaching, learning and academic research and may be installed on university owned and personal machines.
This software is also available to current Earth & Energy students through the Virtual lab." - Programming with MATLAB"The best way to learn how to program is to do something useful, so this introduction to MATLAB is built around a common scientific task: data analysis. Our real goal isn’t to teach you MATLAB, but to teach you the basic concepts that all programming depends on. [...] GNU Octave is a free and open-source alternative to MATLAB which shares its syntax (see more about compatibility). Thus, if you don’t have access to MATLAB, you can easily set up Octave on your computer and still work through the lesson." OU IT provides a campus MATLAB license.
- MATLAB Onramp"Get started quickly with the basics of MATLAB." - 2 hour online tutorial that can be accessed using OU's site license for MATLAB.
- Matlab byCall Number: eBookISBN: 9780128163450Publication Date: 2018"guides the reader through both programming and built-in functions to easily exploit MATLAB's extensive capabilities for tackling engineering and scientific problems. Assuming no knowledge of programming, this book starts with programming concepts, such as variables, assignments, and selection statements, moves on to loops, and then solves problems using both the programming concept and the power of MATLAB."
- Matlab Primer byCall Number: QA 297 .S4787 2005 (also available as eBook)ISBN: 1584885238Publication Date: 2004"Whether you are new to MATLAB, new to version 7.0, or simply in need of a hands-on, to-the-point reference, MATLAB® Primer provides the tools you need in a conveniently sized [...] pocketbook."
- Collecting, processing and presenting geoscientific information : MATLAB® and design recipes for Earth sciences byCall Number: eBookISBN: 9783662562031Publication Date: 2018"The overall aim of the book is to introduce students to the typical course followed by a data analysis project in earth sciences. A project usually involves searching relevant literature, reviewing and ranking published books and journal articles, extracting relevant information from the literature in the form of text, data, or graphs, searching and processing the relevant original data using MATLAB, and compiling and presenting the results as posters, abstracts, oral presentations and multimedia publications using graphics design software."
SAS data analysis
- Power SAS a survival guide byCall Number: eBookISBN: 9781430211556Publication Date: 2002The nine chapters cover SAS basics, data access, data step programming, data manipulation, data management, data presentation, efficiency and performance, configuration and support, and SAS 9.
- A Step-By-Step Approach to Using SAS for Univariate and Multivariate Statistics byCall Number: QA 276.4 .H37 2005ISBN: 1590474171Publication Date: 2005This book is an easy-to-understand introduction to SAS as well as to univariate and multivariate statistics. Clear explanations and simple language guide you through the research terminology, data input, data manipulation, and types of statistical analysis that are most commonly used in the social and behavioral sciences.
- SAS Video trainings on LinkedIn LearningUse your OU email to log into LinkedIn Learning to watch videos about using SAS.
- SAS/STAT 9.3 User's GuideProvides detailed reference material for using SAS/STAT software to perform statistical analyses, including analysis of variance, regression, categorical data analysis, multivariate analysis, survival analysis, psychometric analysis, cluster analysis, nonparametric analysis, mixed-models analysis, and survey data analysis, with numerous examples in addition to syntax and usage information. This document also provides information about the Power and Sample Size Application and extensive information on using ODS Statistical Graphics.
- Documentation for All SAS ProductsChoose the software you are using and it takes you to documentation and user guides.
Qualitative data analysis
- NVivo"offers an intuitive qualitative data analysis experience that helps you uncover deeper research insights." Available with license through OU IT.
- Qualitative Data Analysis with NVivo byCall Number: H61.3 .B396 2019ISBN: 1526449935Publication Date: 2019Packed with real-world examples and case studies, the book supports students through every stage of qualitative data analysis. Contains fully integrated instructions for using NVivo on both Mac and PC, with screenshots and click-by-click guidance.
- NVivo Software Tutorials & SupportThe NVivo software is offered for free to current faculty, staff and students at the University of Oklahoma. This list, which was compiled in Feb. 2023, provides a quick reference guide to support users in getting started with NVivo.
- Taguette"a free and open-source tool for qualitative research. You can import your research materials, highlight and tag quotes, and export the results!"
GIS Software
- ArcGIS ProPowerful proprietary desktop software for GIS.
- ArcGIS OU LicensingClick the blue "Access Software" button on the page. The OU Libraries link on the page is no longer functional as OU has moved to a single-sign-on licensing format for ArcGIS.
Migration from Legacy ArcMap to ArcGIS Pro
- Webinar on migration (includes demos and links to more resources)
- Individual video guides for common GIS actions
- Perspective on GIS software evolution from ESRI
- Switching to ArcGIS Pro from ArcMap book
- Project RFree and open source programming environment. Requires installation of additional packages to enable most GIS applications.
- PythonInformation and downloads for the Python programming language.
- Python AnacondaOpen source Python 3.7 platform for data science.
- Quantum GIS (QGIS)A free and open-source GIS software including both a graphical user interface and options for accessing command line resources.
GIS Tutorials and Help
- ArcGIS documentationStep-by-step instructions and documentation for ArcGIS software including ArcMap.
- ArcGIS training materials by ESRITutorials for ArcGIS produced by the software company. Request an OU-linked ESRI account using the OU licensing link above.
- Python in ArcGISESRI free video tutorial on scripting with Python for ArcGIS.
- QGIS tutorialsLinks compiled by QGIS maintainers for QGIS training materials and tutorials in multiple languages.
- Learning geospatial analysis with python : understand GIS fundamentals and perform remote sensing data analysis using python 3.7 byCall Number: Online accessISBN: 1789957931Publication Date: 2019"This book will take you through GIS techniques, geodatabases, geospatial raster data, and much more using the latest built-in tools and libraries in Python 3.7." Recommended by a grad student who found chapters 2 and 3 explained a lot of concepts that get glossed over in courses.
- R-Spatialr-spatial.org is a website and blog for those interested in using R to analyse spatial or spatio-temporal data.
- Geocomputation in R"This book is about using the power of computers to do things with geographic data. It teaches a range of spatial skills, including: reading, writing and manipulating geographic data; making static and interactive maps; applying geocomputation to solve real-world problems; and modeling geographic phenomena."
- GIS Help at OUThis page features projects on which OU Libraries has assisted with GIS work and contains a link to the Digital Scholarship Lab for consultations.
- Digital ScholarshipContact a Digital Scholarship specialist for GIS assistance.
- Stack Exchange for GISA searchable question and answer website for GIS users. If you can't find your answer, you can create a reproducible example and ask your own question.
Bioinformatics Software
- ELIXIR Bioinformatics Tools and Data Services RegistryWell-curated directory with robust searching and filtering capabilities
- FDA: Bioinformatics ToolsList of tools for working with datasets
- GalaxyGalaxy is an open source, web-based platform for data intensive biomedical research. If you are new to Galaxy start here or consult our help resources. You can install your own Galaxy by following the tutorial and choose from thousands of tools from the Tool Shed.
- BioconductorBioconductor provides tools (in R) for the analysis and comprehension of high-throughput genomic data.
- Qiita"Qiita (canonically pronounced cheetah) is an entirely open-source microbial study management platform. It allows users to keep track of multiple studies with multiple ‘omics data. Additionally, Qiita is capable of supporting multiple analytical pipelines through a 3rd-party plugin system, allowing the user to have a single entry point for all their analyses."
Visualizations with Inkscape (open source) and Illustrator (Adobe, OU license)
- Adobe Illustrator Basics"Illustrator is ideal when working with logos, complex illustrations, or anything that will appear in a variety of printed formats because the vector graphics will not lose quality when their sizes are altered."
- Inkscape for STEM"intended to help researchers learn how to use the free vector-based graphical software Inkscape."
- Digital illustration fundamentals byCall Number: eBookISBN: 1484216970Publication Date: 2015This concise book covers concepts central to digital illustration using the Inkscape 0.91 open source software package as well as the Adobe Illustrator professional Illustration package. The book builds on the foundational concepts of vector graphics and the SVG format, and gets more advanced as chapters progress, covering what vector new media formats are best for use with Android Studio, Java and JavaFX, iOS, Kindle and HTML5, as well as key factors regarding the data footprint optimization work process, and why data footprint optimization is important.
- Adobe Illustrator Classroom in a Book (2021 Release) byCall Number: eBookISBN: 9780136805533Publication Date: 2020"Adobe Illustrator Classroom in a Book (2021 release) contains lessons that cover the basics and beyond, providing countless tips and techniques to help you become more productive with the program. You can follow the book from start to finish or choose only those lessons that interest you."
- Vector Basic Training byCall Number: Youngblood Energy Library (Geology) Stacks T 385 .G588 2016ISBN: 9780134176734Publication Date: 2015"learn how and when to create art point-by-point through rough building and refinement as well as through shape-building techniques using Illustrator's Pathfinder palette."
- Illustrator CS6 byCall Number: Youngblood Energy Library (Geology) Stacks T 385 .W454123 2013 (also eBook)ISBN: 9780321822178Publication Date: 2012"Readers can follow the steps in a tutorial fashion, or use the book as a reference guide to individual program features. Many of the key illustrations used in the book are available via Web download for practice by readers.