Big Data Training – NTI

This repository documents my hands-on journey through the Big Data Summer Training Program organized by NTI in collaboration with ITIDA.
The program diving deep into Big Data tools, platforms, and real-world applications.

What You’ll Find Here

This repo contains:

Jupyter Notebooks & Labs from each topic
Technical notes & key takeaways
Practice examples and use-case simulations
Commands and setups used in the virtual environment

Topics Covered

Big Data Era & Kunpeng Architecture
HDFS + ZooKeeper – Distributed storage and cluster coordination
HBase + Hive – NoSQL + distributed data warehouse (SQL-like)
ClickHouse – OLAP database for real-time analytics
MapReduce + YARN – Distributed processing engine and resource manager
Spark + Flink – Batch + Stream processing with in-memory computing
Flume + Kafka – Data ingestion and real-time messaging pipelines
Elasticsearch – Distributed search engine and analytics

Tools & Technologies

Tool/Tech	Use Case
Linux, SQL, Python	Foundations for scripting &querying
HDFS	Distributed data storage
Hive	SQL-style querying
HBase	NoSQL for large-scale datasets
Kafka	Real-time messaging system
Spark & Flink	Data processing engines
ClickHouse	High-performance analytics
Flume, Sqoop	Data ingestion from logs & DBs
Elasticsearch	Search and analytics
ZooKeeper	Cluster coordination

Goal of this Repo

This repo serves as:

A personal reference and knowledge base
A full recap of my learning journey
A practical showcase for Big Data skills

Let's Connect

Feel free to explore the notebooks or reach out if you'd like to collaborate or discuss Big Data topics!

Reach out to me on LinkedIn

Name		Name	Last commit message	Last commit date
Latest commit History 9 Commits
Labs		Labs
Notebooks		Notebooks
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Big Data Training – NTI

What You’ll Find Here

Topics Covered

Tools & Technologies

Goal of this Repo

Let's Connect

About

Uh oh!

Releases

Packages

Languages

keroloshany47/NTI_Big_Data

Folders and files

Latest commit

History

Repository files navigation

Big Data Training – NTI

What You’ll Find Here

Topics Covered

Tools & Technologies

Goal of this Repo

Let's Connect

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages