DEPTH PERCEPTION USING STEREO VISION

Estimation of dense depth information of the surroundings is required in applications like navigation, sensing and 3D modelling of environment, path planning, obstacle avoidance, surveillance and so on for intelligent robots, drones, autonomous vehicles etc. Stereo vision based depth perception is a stereoscopic ranging technique, useful to estimate the 3D profile of the scene from the 2D stereo image pair. Depth is perceived by computing the disparity from inter image correspondence, of the stereo image pair, which is highly computational intensive and achieving this in real time is a greatly challenging task. Specifically, census transform and sum of absolute difference (SAD) algorithms are tested on various platforms to test their accuracy and explore their deployability on Zedboard FPGA.

Census Transform on Verilog for FPGA

The verilog code for the implementation of depth perception is in the StereoCensus-verilog-impl-Census sub-directory. The left and right image are from the Middlebury Cones dataset. Additonal information specified in sub-directory.

Census Transform on Python

python_census.py is a Python model of the Census transform stereo algorithm, useful for evaluating accuracy of a given parameter set.

The algorithm for census transform is as follows:

For a pixel in the reference image, find it’s census vector a. For calculating the census vector, take a window around the pixel. b. Compare every pixel in the window to the center pixel. If its intensity value is greater than the center pixel, assign it as one if greater than center or 0 if lesser.
c. Make a vector consisting of 0s and 1s from above assignments
Calculate the census vectors for the corresponding pixels in the right image. The number of pixels for which the census vector is to be calculated is equal to the search range decided by the user.
Compute the Hamming Distances for Census vectors in the right image with Census vector in the left image.
The index of the window corresponding to the Census vector with minimum hamming distance is the disparity

To run:

python_census.py -l img_left.png -r img_right.png -o img_out.png

It runs with Numpy and Pillow under Python 3. Image formatting is handled by PIL.

Sum of absolute differences on C

pgmIO.cpp is used for reading in pair of stereo images in PGM format and storing in linear array for further processing.

stereoGold.cpp is the primary code for stereo-correspondence computation. The images are padded with a boundary of zeroes in order minimise loss of data from the edge. Following padding, the search range can be set in the code (has to be between 0-255). With every pass, the target image is shifted to the right and SAD is calculated. The shift for the minimum SAD value accumulated from the window over the center pixel is stored as the disparity. Histogram equalization is done before outputting array to reconstruct disparity image.

To run:

gcc main.cpp pgmIO.cpp stereo_gold.cpp -o executable.o

Name		Name	Last commit message	Last commit date
Latest commit History 11 Commits
StereoCensus-verilog-impl-Census		StereoCensus-verilog-impl-Census
census_python		census_python
stereoCPU-C		stereoCPU-C
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

DEPTH PERCEPTION USING STEREO VISION

Census Transform on Verilog for FPGA

Census Transform on Python

Sum of absolute differences on C

About

Uh oh!

Releases

Packages

Languages

adb1997/Depth_Estimation_Project

Folders and files

Latest commit

History

Repository files navigation

DEPTH PERCEPTION USING STEREO VISION

Census Transform on Verilog for FPGA

Census Transform on Python

Sum of absolute differences on C

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages