Automated Classroom Attendance System Using Deep Learning and Transformers

Overview

This project focuses on developing an Automated Classroom Attendance System leveraging Deep Learning and Transformer models to accurately count students in classrooms. By addressing challenges like occlusions, overlapping individuals, and varied lighting conditions, the system ensures high accuracy and robust performance in real-world scenarios.

Visualization

Interface

Abstract

The system automates the process of taking attendance by using advanced Convolutional Neural Networks (CNNs) and Transformers to detect and count students in classroom images or videos. A custom dataset was created for training and evaluation, ensuring adaptability to diverse scenarios. The solution integrates seamlessly with a user-friendly interface for real-time processing, enhancing classroom management efficiency.

Objectives

Automate classroom attendance using deep learning techniques.
Develop a robust CNN-based model to predict student counts accurately.
Address challenges such as occlusions, overlapping, and varied densities.
Design a model that generalizes across diverse classroom setups.
Enable real-time deployment with edge devices for dynamic monitoring.
Create an intuitive interface for easy input and visualization of results.

Problem Statement

Accurately counting individuals in crowded environments remains challenging due to:

Occlusions
Overlapping individuals
Varied densities
Environmental factors like lighting and shadows

Traditional approaches fail to generalize across scenarios, resulting in inaccurate predictions. This project aims to tackle these limitations by developing a scalable, efficient solution using deep learning techniques.

Techniques and Tools Used

Development Frameworks & Libraries

Deep Learning:
- PyTorch (Primary framework for ResNet18 implementation)
- TensorFlow & Keras (Secondary framework for VGG16 implementation)
Data Processing:
- NumPy
- Pandas
Visualization:
- Matplotlib
- Seaborn

Web Development Stack (MERN)

Frontend: React.js
Backend: Python Flask (RESTful API)
Version Control: GitHub

Model Architectures

ResNet18:
- Handles images with overlapping individuals using residual blocks.
VGG16:
- Optimized for non-overlapping images with dense layers and high feature extraction capabilities.
Planned: Transformer-based models for advanced use cases.

Dataset Details

Place the Dataset URL here

Total Images: 598
Source: Custom dataset from classroom scenarios via CC cameras
Key Features:
- Multiple camera angles
- Varied lighting conditions
- Mixed occupancy levels (overlapped and non-overlapped scenarios)
Augmentation Techniques:
- Random rotation
- Brightness variation
- Horizontal flipping
- Random cropping
Dataset Online: Kaggle

Implementation Highlights

Data Preprocessing:
- Augmentation for dataset diversity
- Normalization and resizing of images
Model Training:
- Loss Function: Mean Squared Error (MSE)
- Optimizer: Adam optimizer
- Configuration: 50 epochs, batch size of 16
- Models are saved in HuggingFace Hub
Evaluation Metrics:
- MAE (Mean Absolute Error)
- MSE (Mean Squared Error)
- Density map visualization
Deployment:
- Flask API for real-time inference
- Edge device integration for on-the-spot monitoring
User Interface:
- Options to upload images/videos or enter URLs
- Real-time detection results with bounding boxes and counts Include the deployed models URL

Results

Accuracy: 85-95% across diverse scenarios
Processing Time: < 3 seconds per image
Scalability: Supports up to 100 simultaneous users

Future Scope

Video Analysis Integration:
- Extend the system to analyze video sequences for real-time monitoring.
Dataset Expansion:
- Include diverse classroom setups, lighting conditions, and crowd densities.
Enhanced Attention Mechanisms:
- Implement advanced attention layers for improved accuracy.
Cross-Domain Applications:
- Adapt the model for applications like wildlife counting, traffic monitoring, and disaster management.
Performance Optimization:
- Reduce computational overhead for faster processing and scalability.

How to Run the Project

Clone the repository: git clone <repository-link>
Install dependencies: pip install -r requirements.txt
Run the Flask server: python app.py
Open the frontend: Navigate to the React.js project directory and run npm start

License

MIT License

Installation

Clone the repository:

git clone https://github.com/harinivas-28/count-app.git

Navigate to the project directory:
```
cd count-app
```

Install dependencies:

npm install

cd backend
pip install requirements.txt

Usage

To start the frontend application, run:

npm run dev

To run the backend

NOTE: Use virtual environment for python

cd backend
python app.py

Contributing

Contributions are welcome! Please fork the repository and create a pull request with your changes.

Contact

For any questions or feedback, please contact harinivasg28704@gmail.com or harinivas.ganjarla@gmail.com

Name		Name	Last commit message	Last commit date
Latest commit History 28 Commits
backend		backend
docs		docs
public		public
src		src
.gitignore		.gitignore
README.md		README.md
SDD.md		SDD.md
SRS.md		SRS.md
eslint.config.js		eslint.config.js
index.html		index.html
package-lock.json		package-lock.json
package.json		package.json
vite.config.js		vite.config.js

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Automated Classroom Attendance System Using Deep Learning and Transformers

Overview

Visualization

Interface

Abstract

Objectives

Problem Statement

Techniques and Tools Used

Development Frameworks & Libraries

Web Development Stack (MERN)

Model Architectures

Dataset Details

Implementation Highlights

Results

Future Scope

How to Run the Project

License

Installation

Usage

Contributing

Contact

About

Uh oh!

Releases

Packages

Uh oh!

Languages

harinivas-28/Object-Detection-In-Images

Folders and files

Latest commit

History

Repository files navigation

Automated Classroom Attendance System Using Deep Learning and Transformers

Overview

Visualization

Interface

Abstract

Objectives

Problem Statement

Techniques and Tools Used

Development Frameworks & Libraries

Web Development Stack (MERN)

Model Architectures

Dataset Details

Implementation Highlights

Results

Future Scope

How to Run the Project

License

Installation

Usage

Contributing

Contact

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages