In celebration of Pi Day*, we are highlighting the data sciences workstation at the NIH Library. The workstation provides tools for data analysis, processing, and visualization, as well as links to information on using data tools, conducting research using data mining, and finding data available for reuse. The workstation can be reserved for up to 4 hours per day from the data services page.
The following programs are available on the data sciences workstation in the NIH Library:
- Anaconda: A high-level, general-purpose, dynamic programming language that combines 195 of the most popular Python packages for science, math, and data analysis.
- JMP: Statistical software that allows you to perform data analysis, data acquisition and cleanup, data visualization, reliability analysis, text exploration, statistical modeling. JMP integrates with SAS, MATLAB, and R.
- MATLAB: A programming language that allows you to analyze data, develop algorithms, and create models and applications. The language, tools, and built-in math functions enable you to explore multiple approaches and reach a solution faster than with spreadsheets or traditional programming languages.
- OpenRefine: A tool for working with datasets that need to be transformed, cleaned, or processed before use. The OpenRefine facet and filter functions facilitate initial exploration of large datasets.
- R and RStudio: The R environment provides an integrated suite of software facilities for data manipulation, calculation and graphical display. RStudio is an open source user interface for R that provides access to some packages not available in the basic R environment.
- Stata: A complete, integrated statistical software package that provides everything you need for data analysis, data management, and graphics. A point-and-click interface and a powerful, intuitive command syntax make Stata fast, accurate, and easy to use.
- SPSS: An advanced statistical software analysis program that enables you to quickly dig deeper into your data, making it a more effective tool than spreadsheets, databases, or standard multi-dimensional tools for analysts. SPSS Statistics includes functionalities such as descriptive statistics, data preparation, graphing, bivariate statistics procedures, factor and cluster analysis, and linear and ordinal regression.
- Statistica: An advanced analytics software package that provides a comprehensive array of data analysis, data management, data visualization, and data mining procedures. It includes a wide selection of predictive modeling, clustering, classification, and exploratory techniques in one platform.
Additional information about software available on the data sciences workstation can be found here.
*Pi Day is traditionally celebrated on March 14th (3.14) around the world. The Greek letter Pi is the symbol used in mathematics to represent a constant—the ratio of the circumference of a circle to its diameter—which is approximately 3.14159. Pi has been calculated to over one trillion digits beyond its decimal point. As an irrational and transcendental number, it will continue infinitely without repetition or pattern. NIH traditionally celebrates Pi Day by exploring the role that the quantitative sciences play in biomedical science.
The NIH Library in Building 10 serves the information needs of NIH staff and select Department of Health and Human Services (HHS) agencies. The NIH Library is part of the Office of Research Services (ORS) in the Office of the Director (OD).NIH Library classes, events, resources, and services, subscribe to our e-news.