Skip to content

Kavya071/AIProjectCollector

Folders and files

NameName
Last commit message
Last commit date

Latest commit

Β 

History

4 Commits
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 

Repository files navigation

πŸ€– AI Project Collector

AI Project Collector is a Python-based web scraping tool that uses Selenium and BeautifulSoup to collect AI-related project listings from Freelancer.com. It cleans, structures, and exports the data into an Excel file, making it easy for developers and researchers to explore trends in the freelance AI job market.


πŸ“ Project Structure

  • s.py: Main Python script that handles scraping, cleaning, and exporting data.
  • projects_data_cleaned.xlsx: Output Excel file containing structured data of scraped AI projects.

πŸ› οΈ Technologies Used

  • Python: Primary language for scripting.
  • Selenium: Automates browser interaction for dynamic web scraping.
  • BeautifulSoup: Parses and extracts data from HTML.
  • Pandas: Organizes and exports data into Excel format.
  • openpyxl: Engine used by Pandas to write Excel files.

πŸš€ Getting Started

1. Clone the Repository

git clone https://github.com/Kavya071/AIProjectCollector.git
cd AIProjectCollector

2. Install Dependencies

pip install pandas selenium beautifulsoup4 openpyxl

3. Setup ChromeDriver

  • Download ChromeDriver compatible with your Chrome version.
  • Update the path in s.py here:
service = Service('path/to/chromedriver')

4. Run the Script

python s.py

The script will scrape AI job listings from pages 1 to 40 on Freelancer.com and save the data to projects_data_cleaned.xlsx.


πŸ“Š Features

  • Scrapes AI-related freelance projects across multiple pages.
  • Extracts title, average bid, skills, bid count, and description.
  • Cleans illegal/unprintable characters.
  • Exports clean structured data to an Excel file.

πŸ“§ Contact

For queries or suggestions:


About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors