Skip to content

Coleção de links, snippets, notebooks e outras coisas relacionadas ao aprendizado e conhecimento em Data Science

Notifications You must be signed in to change notification settings

sandysnunes/data-science-notes

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

47 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

My notes on Data Science

Table of Contents

  1. Frameworks, Tools and libraries
  2. Visualisation
  3. Snippets
  4. Books
  5. Papers
  6. Datasets
  7. Infographic
  8. Cheat Sheets
  9. Interview Questions
  10. Notebooks
  11. GitHub Repos
  12. Podcasts
  13. Communities

Frameworks, Tools and libraries

Visualisation

Snippets

Jupyter - Pyspark
docker run --rm \
-v ~:/home/jovyan/work \
-p 8888:8888 \
-p 4040:4040 \
-p 4041:4041 \
jupyter/pyspark-notebook
import pyspark 
from pyspark.sql import SparkSession

#spark context
sc = pyspark.SparkContext('local[*]')

#spark session
spark = SparkSession.builder.appName('App name').getOrCreate()

# do something to prove it works
rdd = sc.parallelize(range(1000))
rdd.takeSample(False, 5)
D3 on Jupyter
!pip install py_d3 -q
%load_ext py_d3
%%d3


<div id="my_dataviz"></div>


<script>
    //your code here
</script>

Books

Papers

Datasets

Infographic

Preview Description
A visual guide to Becoming a Data Scientist in 8 Steps by DataCamp (img)
Mindmap on required skills (img)
Swami Chandrasekaran made a Curriculum via Metro map.
by @kzawadz via twitter
By Data Science Central
From this article by Berkeley Science Review.
Data Science Wars: R vs Python
How to select statistical or machine learning techniques
Choosing the Right Estimator
The Data Science Industry: Who Does What
Data Science Venn Diagram
Different Data Science Skills and Roles from this article by Springboard
Data Fallacies To Avoid A simple and friendly way of teaching your non-data scientist/non-statistician colleagues how to avoid mistakes with data. From Geckoboard's Data Literacy Lessons.

Cheat Sheets

Notebooks

GitHub Repos

Podcasts

Communities

Interview Questions

About

Coleção de links, snippets, notebooks e outras coisas relacionadas ao aprendizado e conhecimento em Data Science

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published