Python Developer
Job Description
Job Title: Python NLPLocation: Mumbai (Hybrid)
Experience :3 to 6 years
Employment Type: Full-Time
Job Overview
Key Responsibilities:
Develop and implement Python-based solutions for extracting and processing data from PDFs.
Utilize NLP libraries (such as spaCy, NLTK, or similar) to analyze and structure text data.
Optimize text extraction processes for accuracy and efficiency.
Work with structured and unstructured data formats to transform extracted data into usable insights.
Debug and resolve issues related to text parsing and extraction.
Collaborate with cross-functional teams to refine data processing workflows.
Mandatory Skills:
Strong proficiency in Python programming.
Hands-on experience with NLP libraries like spaCy, NLTK, TextBlob, or Transformers.
Experience in extracting text from PDFs using tools like PyMuPDF, PDFMiner, or Tesseract OCR.
Understanding of regular expressions (RegEx) for text pattern matching.
Familiarity with data processing and text cleaning techniques.
Preferred Skills:
Knowledge of machine learning techniques for text classification and entity recognition.
Experience with document OCR tools such as Tesseract or Amazon Textract.
Familiarity with data storage solutions like SQL, NoSQL, or Pandas DataFrames.
Exposure to cloud-based NLP services (Google NLP, AWS Comprehend, etc.).