Hi, I am an experienced and proficient web scraper using Python to obtain very large amounts of data from a variety of online sources, I have developed hundreds of scrapers for wide range of purposes like gathering information from different types of business directories, e-commerce sites and directories, social networks, review sites, event schedules, transport schedules etc. I have been able to retrieve data from articles, tables, lists, recursively via search results, from sites with AJAX/Javascript, and even when authentication is required. Any project you have I would be able to discuss and preview the site(s) which need to be scraped in order to provide you the output you are looking for in .CSV or other format.
- 10M Amazon black t-shirts Images scraping,
- Distil (undetectable) website scraping,
- betting website scraping,
- yellow page scraping,
- FB and Linkedin emails scraping,
- WAF(web application firewall) website scraping,
- Fake website scraping,
- Sub-domains web scraping...
on SaaS and several web platforms using Python (BS4, Selenium, Scrapy).
I have experience in Proxy IP Rotation and bypassing Captcha using OCR and deathbycaptcha & [login to view URL] API, AWS(s3,ec2), Excel Spreadsheet and MySQL.
I am very familiar with selenium for web automating bots, PhantomJS for DOM handling, CSS selector, JSON, Canvas, and SVG.
As a dedicated full time developer, I can work 40+ hours/week in your time-zone.
Looking forward t