Lead AI/ML Engineer
dataplor
Software Engineering, Data Science
United States
Posted on Oct 11, 2024
Location: Remote or any of our offices in Honolulu, San Francisco, New York, Los Angeles, Denver or Miami
About Us
We are dataplor, a rapidly expanding global location data company. Our mission is to comprehensively map and understand every commercial location on the planet. We are seeking a highly skilled and strategic individual to join our team as a Lead AI/ML Engineer. This role requires a strong comfort level with remote work and global collaboration, as well as a passion for leveraging AI/ML technologies to solve complex data challenges.
Role Overview
The Lead AI/ML Engineer at dataplor must be adept at building, deploying, and managing models, tackling tasks of any scale. This pivotal role will advance our location data products by addressing various data challenges, such as processing global polygons, deduplicating records, and cleaning data from thousands of sources in different languages. The engineer will collaborate cross-functionally with data operations, data scientists, software engineers, and product teams, reporting directly to the CTO. The ideal candidate will demonstrate strategic thinking and innovative solutions to drive success.
Key Responsibilities
Data Processing and Management:
About Us
We are dataplor, a rapidly expanding global location data company. Our mission is to comprehensively map and understand every commercial location on the planet. We are seeking a highly skilled and strategic individual to join our team as a Lead AI/ML Engineer. This role requires a strong comfort level with remote work and global collaboration, as well as a passion for leveraging AI/ML technologies to solve complex data challenges.
Role Overview
The Lead AI/ML Engineer at dataplor must be adept at building, deploying, and managing models, tackling tasks of any scale. This pivotal role will advance our location data products by addressing various data challenges, such as processing global polygons, deduplicating records, and cleaning data from thousands of sources in different languages. The engineer will collaborate cross-functionally with data operations, data scientists, software engineers, and product teams, reporting directly to the CTO. The ideal candidate will demonstrate strategic thinking and innovative solutions to drive success.
Key Responsibilities
Data Processing and Management:
- Develop and implement AI/ML models to process and analyze large-scale location data
- Work on cleaning, deduplicating, and normalizing data from multiple sources
- Enhance our global polygon data to ensure accuracy and consistency
- Build and maintain ML pipelines for deduplication and data standardization
- Utilize NLP techniques to handle multilingual data and improve data extraction and classification
- Develop and fine-tune Large Language Models (LLMs) to improve data quality and processing efficiency
- Collaborate with data scientists, engineers, and product managers to identify and solve data-related challenges
- Stay up-to-date with the latest advancements in AI/ML and NLP technologies and integrate them into our workflows
- Contribute to the continuous improvement of our data processing frameworks and methodologies
- Play a key role in the development and launch of new products
- Participate in strategic planning sessions and contribute to high-level decision-making processes
- Work closely with leadership to align AI/ML initiatives with company goals and objectives
- Proven experience (5+ years) in AI/ML engineering building and deploying models, preferably in the location data or geospatial industry
- Demonstrated experience successfully deploying models into production and managing these models for ongoing improvements
- Experience working with large datasets, multiple sources, and developing data processing pipelines
- Proficiency in programming languages such as Ruby or Python
- Strong knowledge of machine learning frameworks (e.g., TensorFlow, PyTorch) and NLP techniques
- Experience with geospatial data processing tools and libraries (e.g., PostGIS GeoPandas, Shapely)
- Familiarity with cloud platforms (e.g., AWS, GCP) and containerization (e.g., Docker, Kubernetes)
- Excellent problem-solving and analytical skills
- Strong communication skills, with the ability to explain complex concepts to non-technical stakeholders
- Ability to work both independently and collaboratively in a fast-paced, dynamic environment
- Passion for leveraging AI/ML to solve complex data challenges
- Competitive salary, health insurance, perks, and equity package
- Flexible working hours and remote work options
- Professional development opportunities
- A collaborative and supportive work environment
- A seat at the table for strategic planning and product development discussions