GitHub - Krishna9588/Data_Scraping: Selenium - Based data scraping for companies contact data.

Problem Statement

1. Contact Data Scraping

Scrape publicly available company data from their websites — focus only on what’s reliably accessible.

Field	Status	Notes
Website URL	Required	Use Google Search fallback if not provided
General Contact Email	Required	Usually in “Contact Us” or page footer
Phone Number	Required	Found in footer or contact page
Location / Address	Required	Available on contact or “About” page
Leadership Names	Optional	Only if listed under “Team” or “Leadership”

Output Required:

Company	Website	Email	Phone	Location	Found Leadership Name(s)

2. Technology Detection via Python

For each company (list given), write a Python script to identify whether the website mentions:

Cloud Technologies (e.g., AWS, Azure, GCP) MES Technologies (e.g., Siemens Opcenter, Rockwell FactoryTalk) PLM Technologies (e.g., Teamcenter, Windchill, ENOVIA)

Output Required:

Company	Website	Cloud Tool Mentioned	Source URL	MES Tool Mentioned	Source URL	PLM Tool Mentioned	Source URL
Company A	https://www.companya.com	AWS	www.companya.com/tech	Siemens Opcenter	www.companya.com/tech	-	www.companya.com/tech
Company B	https://www.companyb.com	GCP	www.companyb.com/products	-	www.companyb.com/products	Teamcenter	www.companyb.com/products

To use the Data Scraping Script for extracting details

Either use Function7 or Function9

error_correction. Might not work well as some problems are occurring after navigate_to_contact_page() - function

Test on an unknown website, other than the 10 provided.

Script successfully fetched information from ifm's website, Including Contact number and Email Address, but got stuck collecting address details. Result

Run below mentioned scripts for better results.

    function7.py

    function9.py

To convert an Excel file to .csv, use this

    convert.py

Name		Name	Last commit message	Last commit date
Latest commit History 9 Commits
.idea		.idea
dataset		dataset
output		output
Problem-1.py		Problem-1.py
Problem-2.py		Problem-2.py
Readme.md		Readme.md
Test_Result.png		Test_Result.png
contact_data _P1.csv		contact_data _P1.csv
convert.py		convert.py
error_correction.py		error_correction.py
function7.py		function7.py
function9.py		function9.py
problem_statement.md		problem_statement.md
requirements.txt		requirements.txt
technology_detection _P2.csv		technology_detection _P2.csv

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Problem Statement

1. Contact Data Scraping

2. Technology Detection via Python

To use the Data Scraping Script for extracting details

Test on an unknown website, other than the 10 provided.

Run below mentioned scripts for better results.

To convert an Excel file to .csv, use this

About

Uh oh!

Releases

Packages

Languages

Krishna9588/Data_Scraping

Folders and files

Latest commit

History

Repository files navigation

Problem Statement

1. Contact Data Scraping

2. Technology Detection via Python

To use the Data Scraping Script for extracting details

Test on an unknown website, other than the 10 provided.

Run below mentioned scripts for better results.

To convert an Excel file to .csv, use this

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages