Stat 351 - Statistical Computing II: Data Management and Visualization

Textbook: Statistical Computing in R and Python

Backwards Design

At the end of this course, students should know how to:

Access and leverage data stored in formats which are commonly used outside of statistics (HTML, JSON, XML, PDF, APIs) and transform these data to formats which are used for statistical analysis.
- Scrape data off of the internet and assemble it into a "tidy" format for visualization and analysis.
- Read in structured data from record-based formats (XML, JSON) and transform this data to a table-based format.
- Use optical character recognition and other tools to extract data from a PDF file systematically.
- Use an API to request data from an online service.
- Implement data cleaning and quality control measures to ensure that data is read in correctly.
Develop skills for visualization and communication of complex data using interactive graphics. You will be able to
- Determine when an interactive chart is preferable to a static chart.
- Create an interactive chart using JavaScript-based tools such as Plotly, Observable.js, or Shiny.
- Integrate your interactive chart into a report or web page, along with supportive text describing the chart and important findings.
Understand and leverage data management tools for storing and manipulating data, including
- Identifying situations where an external database is preferable to working with data in-memory.
- Accessing data in an external SQL, Parquet, or Arrow database.
- Discussing the trade offs between different tools for data management and different approaches to data storage.
- Design an analysis strategy for large data which does not fit into computer memory by selecting from strategies such as sampling and split-apply-combine.

Timeline

See schedule.xlsx

Course site information

Configured in _quarto.yml
Week by week files built automatically (code in code/gen-week-files-from-course-schedule.R, data in course-schedule.xlsx)
Syllabus uses course-schedule.xlsx for topics, with due dates and semester dates specified in sheets in the spreadsheet.

Name		Name	Last commit message	Last commit date
Latest commit History 30 Commits
_extensions		_extensions
_freeze		_freeze
assignment-repos		assignment-repos
code		code
design		design
docs		docs
images		images
partials		partials
slides		slides
weeks		weeks
.gitignore		.gitignore
.gitmodules		.gitmodules
404.qmd		404.qmd
LICENSE		LICENSE
README.md		README.md
_quarto.yml		_quarto.yml
_variables-darkly.scss		_variables-darkly.scss
_variables-flatly.scss		_variables-flatly.scss
course-links.qmd		course-links.qmd
course-overview.qmd		course-overview.qmd
course-support.qmd		course-support.qmd
index.qmd		index.qmd
schedule.R		schedule.R
schedule.yaml		schedule.yaml
syllabus.qmd		syllabus.qmd
theme-dark.scss		theme-dark.scss
theme.scss		theme.scss
unl-stat351.Rproj		unl-stat351.Rproj

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Stat 351 - Statistical Computing II: Data Management and Visualization

Backwards Design

Timeline

Course site information

About

Uh oh!

Releases

Packages

Languages

License

unl-statistics/stat351

Folders and files

Latest commit

History

Repository files navigation

Stat 351 - Statistical Computing II: Data Management and Visualization

Backwards Design

Timeline

Course site information

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages