Skip to content

Infostrux-Solutions/hol-snowflake-ds-ml

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

19 Commits
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Hands-On Lab

Welcome to the Hands-On Lab for Data Science in Snowflake! This repository contains all the necessary files and instructions to set up and run the lab environment using Snowflake, Streamlit, and various Python packages.

Table of Contents

Prerequisites

Before you begin, ensure you have the following:

  • A Snowflake account with appropriate permissions.
  • Python 3.11.9 installed on your machine.
  • pip package manager.
  • Access to the GitHub repository: hol-notebook.

Setup Instructions

  1. Clone the Repository

    Clone the repository to your local machine:

    git clone https://github.com/Infostrux-Solutions/hol-snowflake-ds-ml.git
    cd hol-notebook
  2. Set Up Snowflake Environment (via snowsql, not sure this works)

    Run the setup.sql script to set up the Snowflake environment:

    snowsql -f setup.sql

    This script will:

    • Create a new organization account.
    • Create a new database and warehouses.
    • Set up roles and permissions.
    • Create API and external access integrations.
  3. Set Up Snowflake Environment (manually)

    1. modify setup.sql to specify an inital password, edit the list of named users and set the number of nameless users to create and then run setup.sql
    2. modify generate_qualification_dataset.sql to set the row_count and then run to manufacture the qualification dataset
    3. create a table HOL_LAB_DATASCIENCE.LAB.E_COMMERCE_REVIEWS from the data_reviews_data.json file and make sure you do this using the ROLE_HOLE_LAB_DATASCIENCE role

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Contributors 2

  •  
  •