Legistify, a software company designed to simplify legal operations for enterprises, is seeking an experienced Python Developer with a strong focus on web scraping to join their team in Gurugram, India. This role offers an exciting opportunity to work on challenging web scraping projects in the growing legal tech space.
As a Python Developer (Web Scraping Specialist), you will be responsible for extracting and processing data from websites using Python-based scraping tools and libraries. This position is ideal for a detail-oriented individual with expertise in building robust web scrapers and handling large data sets.
Key responsibilities include:
- Developing, maintaining, and optimizing Python-based web scrapers to extract data from various websites efficiently
- Scraping and processing structured and unstructured data from multiple sources, ensuring accuracy and completeness
- Designing scripts to automate repetitive scraping tasks and schedule jobs using tools like cron, Airflow, or Celery
- Storing and managing scraped data in databases (SQL/NoSQL) or cloud storage solutions
- Implementing error-handling strategies to deal with issues like captcha, IP blocking, and dynamic content loading
- Ensuring scrapers are optimized for performance and can handle large-scale scraping without crashing or slowing down
- Adhering to web scraping best practices and ensuring compliance with legal standards
- Collaborating with data analysts, product managers, and other developers to understand data requirements and deliver high-quality results
The ideal candidate should have:
- Strong expertise in Python, with specific experience in libraries like BeautifulSoup, Scrapy, Selenium, and Requests
- Familiarity with web scraping tools and techniques for extracting data from dynamic and static websites
- Experience working with RESTful APIs and parsing various data formats
- Knowledge of databases such as MySQL, PostgreSQL, or MongoDB for storing and processing scraped data
- Problem-solving skills to tackle challenges like CAPTCHA, proxies, and dynamic content
- Experience with Git for version control and collaboration
- Familiarity with performance optimization techniques, including multithreading and asynchronous scraping
This role offers a competitive salary range of ₹60,000 to ₹750,000 INR, depending on experience and skills. Join Legistify and be part of a team that's revolutionizing legal operations for enterprises through innovative software solutions.