Lateral Control for Autonomous Vehicles (Self Driving Car)

Date: May 2025 Institution: Indian Institute of Technology Jodhpur

Abstract

This project presents a deep-learning–based pipeline for robust lateral control of autonomous vehicles. We compare Convolutional Neural Networks (CNNs) and Vision Transformers (ViTs) for end-to-end steering and throttle prediction, integrating object and lane detection modules for enhanced safety. Synthetic datasets are generated in Udacity and AirSimNH simulators, balanced and augmented to train multi-task models. A web-based Streamlit app demonstrates real-time inference.

1. Experimental Setup

1.1 Simulators

Udacity Simulator: Simplified physics, designed for educational experiments with center/left/right camera streams.
AirSimNH Simulator: Photorealistic environment with detailed sensor models (LiDAR, radar) and ROS/Unreal Engine integration.

1.2 Dataset Details

Udacity Dataset: 4,053 raw image samples (center camera), recorded at 10 Hz over varying tracks and lighting conditions.
- Steering Angle Distribution (raw): heavily concentrated around zero steering.
- Balanced Samples: Removed 2,590 samples from the [-0.1, 0.1] steering bin to mitigate bias, resulting in 1,463 samples.
- Post-Augmentation: Applied geometric and photometric transforms to expand to 3,511 training and 878 validation images.

Transformation Step	Samples Count
Total raw samples	4,053
Removed for balancing	2,590
Remaining after balancing	1,463
Augmented training samples	3,511
Augmented validation samples	878

AirSimNH Dataset: Over 40,000 raw frames captured at 5 Hz across urban and highway scenarios.
- Initial Steering Distribution: heavily skewed to zero (26,012 samples).
- Balanced via Bin Capping: Limited zero-angle bin to 15,000 samples, retaining all other bins.
- Final Dataset: ~36,695 frames before augmentation.

Steering Angle	Initial Count	After Capping
-0.5	1,124	1,124
0.0	26,012	15,000
0.5	7,571	7,571

1.3 Preprocessing Steps

Frame Extraction: Loaded raw PNG/JPG images, resized to 100×320 resolution.
Image Cropping: Removed top 50 pixels (sky and vehicle hood) to focus on roadway.
Color Space Conversion: Converted RGB to YUV for improved lighting invariance.
Normalization: Scaled pixel values to [0, 1] and standardized per-channel mean and std.
Histogram Equalization: Applied CLAHE on Y channel to enhance contrast.
Steering Binning: Discretized continuous steering angles into 15 uniform bins for balancing.
Data Augmentation: In training pipeline:
- Random zoom (0.8–1.2×)
- Random horizontal shift (±50 px) and vertical shift (±10 px)
- Brightness adjustment (±20%)
- Gaussian noise injection (σ=0.01)
- Random horizontal flips (steering angle negated)
Dataset Splitting: 80/20 train/validation split, stratified by steering bins to preserve distribution.

2. Methodology

2.1 Detection Modules

Object Detection (YOLOv5): A YOLOv5 model pretrained on COCO and fine-tuned on simulator data to detect vehicles, pedestrians, and static obstacles. Detections include bounding box coordinates and confidence scores.
Lane Detection: Canny edge detection on grayscale images followed by Hough line transform to extract lane boundary segments; post-processing merges colinear lines and fits lane polynomials.

2.2 Emergency Braking System

Triggering Logic: If YOLOv5 detects any object whose bounding box enters the predefined collision zone (distance < 5 m, central field of view), issue an immediate brake command.
Brake Command: Throttle set to zero; optional handbrake flag in simulator APIs to simulate full stop.

2.3 Emergency Brake + Steering Adjustment

Combined Strategy: Upon obstacle detection in lateral proximity (<2 m from vehicle centerline), throttle is reduced by 50% and steering angle is adjusted away from obstacle.
Steering Adjustment: Compute obstacle centroid in image frame, map to steering offset via linear mapping: Δθ = k·(x_img−x_center), with k calibrated from simulator.

2.4 Model Architectures

The following model variants were evaluated:

CNN-Udacity (Steering Only)
Autoencoder Baseline (Udacity)
CNN-AirSimNH (Steering Only)
CNN-AirSimNH (Steering + Throttle)
Vision Transformer (ViT) Steering Only
Vision Transformer (ViT) Steering + Throttle

Refer to Section 3 for detailed performance results.

2.5 Multi-Task Learning

Loss Function: L = α·MAE_steer + (1−α)·MAE_throttle
Training: Joint backpropagation optimizes shared backbone and distinct heads, using AdamW optimizer and learning rates tuned per model.

2.6 Safety and Smoothing

Exponential Smoothing: ŷ_t = β·y_t + (1−β)·ŷ_{t−1}, with β=0.2 to smooth control actions.
Throttle Modulation: In high-curvature segments (curvature >0.01 m⁻¹), throttle reduced by 30% to maintain stability.

3. Results and Discussion

Performance metrics include Mean Absolute Error (MAE) for steering and throttle.

Model	Steering Validation Loss	Throttle Validation Loss
CNN-Udacity	0.035	—
Autoencoders	0.02	—
CNN-AirSimNH (steer only)	0.009	—
CNN-AirSimNH (multi-task)	0.033	0.25
ViT-AirSimNH (steer only)	0.00001	—
ViT-AirSimNH (multi-task)	0.005	0.009

4. Deployment

Streamlit App: Live demo hosted here.

5. Conclusion and Future Work

This study demonstrates the efficacy of deep learning for lateral control with integrated perception and safety modules. ViT-based models achieved the lowest MAE and collision rates, at the expense of higher parameter counts. Future directions include real-world hardware-in-the-loop testing, reinforcement learning for closed-loop adaptation, and sensor fusion with LiDAR.

References

Redmon, J. et al. "YOLOv5: Real-Time Object Detection." arXiv preprint.
Dosovitskiy, A. et al. "An Image is Worth 16x16 Words: Transformers for Image Recognition at Scale." ICLR 2021.
Code repository: https://github.com/vishvaspatel/Lateral-Control-for-Autonomous-Vehicles-Self-Driving-Car-

License

This project is released under the MIT License. See LICENSE for details.

Name		Name	Last commit message	Last commit date
Latest commit History 33 Commits
AirsimNH		AirsimNH
udacity		udacity
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Lateral Control for Autonomous Vehicles (Self Driving Car)

Abstract

1. Experimental Setup

1.1 Simulators

1.2 Dataset Details

1.3 Preprocessing Steps

2. Methodology

2.1 Detection Modules

2.2 Emergency Braking System

2.3 Emergency Brake + Steering Adjustment

2.4 Model Architectures

2.5 Multi-Task Learning

2.6 Safety and Smoothing

3. Results and Discussion

4. Deployment

5. Conclusion and Future Work

References

License

About

Uh oh!

Releases

Packages

Languages

vishvaspatel/Self-Driving-Car

Folders and files

Latest commit

History

Repository files navigation

Lateral Control for Autonomous Vehicles (Self Driving Car)

Abstract

1. Experimental Setup

1.1 Simulators

1.2 Dataset Details

1.3 Preprocessing Steps

2. Methodology

2.1 Detection Modules

2.2 Emergency Braking System

2.3 Emergency Brake + Steering Adjustment

2.4 Model Architectures

2.5 Multi-Task Learning

2.6 Safety and Smoothing

3. Results and Discussion

4. Deployment

5. Conclusion and Future Work

References

License

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages