Skip to content

Senior AI Engineer, Unstructured AI

Joining Collibra’s Unstructured AI Team

  • Work at the forefront of context engineering - shaping how AI systems retrieve, structure, and leverage context to deliver accurate, high-quality results at scale.
  • Own end-to-end technical delivery of Unstructured AI systems - from feature prototype to stable production across enterprise environments.
  • Build and scale full-stack systems that ingest, process, and enrich large volumes of unstructured content from distributed enterprise silos (PDFs, contracts, reports, and other document types).
  • Collaborate with the Best: Work closely with xYC Founders to understand complex business challenges and deliver Deasy to solve them. Be part of a dynamic team where ideas flow freely and creativity thrives.
  • Learn and Lead: Stay ahead of the curve by engaging with the latest developments in machine learning and AI. Share knowledge and lead by example to maintain high building standards.

This is a hybrid role based in our New York office. Our hybrid model means you’ll work from the office at least two days each week. This setup helps us stay connected, work more closely together, and keep making progress as a team.

Senior AI Engineer at Collibra are responsible for

  • Shipping complex systems under ambiguity - balancing speed and precision in real environments.
  • Writing and reviewing production-grade code across backend (Python, FastAPI).
  • Building/deploying document-processing systems that handle large-scale, unstructured data environments.
  • Integrating data from diverse enterprise data sources (e.g., SharePoint, Salesforce, or internal APIs) to provide context for AI features.
  • Partnering across engineering, product, and sales teams, ensuring alignment from prototype to rollout.
  • Occasionally working with modern frontend development.

You have

  • Strong proficiency in Python (data processing, API development, and integrations).
  • Hands-on work with LLM-based and AI-driven enrichment models (e.g., classification, entity extraction, deduplication, PII detection).
  • Proven ability to deliver production-grade systems using Big Data frameworks (e.g., Spark) to handle data at scale.
  • Solid understanding of data pipelines, microservice architecture, and API design.
  • Experience ingesting and processing data from third-party enterprise sources (e.g., SharePoint/OneDrive, Salesforce, and SaaS-based knowledge bases).
  • Strong communication skills across technical and business teams.
  • Calm, structured decision-making under tight timelines or ambiguity.
  • Familiarity with metadata systems, data cataloging, or document AI workflows.
  • Knowledge of model evaluation best practices.
  • Experience with search relevance.
  • A bachelor’s degree or equivalent related working experience is required.
  • This position is not eligible for visa sponsorship.

You Are

  • Calm, structured decision-making under tight timelines or ambiguity.
  • Capable of communicating clearly across engineering, product, and field teams, ensuring alignment from prototype to rollout.
  • Experienced in spotting risks early, course-correcting without friction, and model composure when delivery timelines are tight.
  • Someone who cares deeply about data quality, precision, and governance.
  • Strong communication and stakeholder-management skills across technical and business teams.

Measures of Success

  • Within your first month, you will develop a deep understanding of our product vision and the unstructured data stack that powers it, shipping your first set of end-to-end features.
  • Within your third month, you will take full ownership of technical delivery for key product areas, building robust capabilities that handle complex document processing and ensure a stable, high-performance experience for users interacting with diverse data sources.
  • Within your sixth month, you will drive the development of ambitious, enterprise-grade AI product features that solve for data at scale, architecting the high-performance pipelines and advanced context engineering required to deliver accurate, reliable results.

Compensation for this role

The standard base salary range for this position is $204,000 - 255,000 per year. This position is not eligible for additional commission-based compensation. Salary offers are based on a combination of factors, including, but not limited to, experience, skills, and location.

In addition to base salary, we offer a competitive total rewards package, including bonus potential, equity for eligible roles, a Flex Fund monthly stipend, pension/401k plans, and more.

Benefits at Collibra

Collibra recognizes and values that everyone has different needs, interests, and life goals. We built our benefits program with flexibility in mind to support you and your loved ones through a diverse range of circumstances and life events. These flexible offerings sit on a foundation of competitive compensation, health coverage, and time off. Learn more about Collibra’s benefits.

We create inclusion and belonging through how we onboard, meet, connect, engage, and communicate. Learn more about diversity, equity, and inclusion at Collibra.

At Collibra, we’re proud to be an equal opportunity employer. We realize the key to creating a company with a world-class culture and employee experience comes from who we hire and creating a workplace that celebrates everyone.

With this, we proudly consider qualified applicants without regard to race, color, religion, creed, gender, national origin, age, disability, veteran status, sexual orientation, pregnancy, sex, gender identity, gender expression, genetic information, physical or mental disability, HIV status, registered domestic partner status, caregiver status, marital status, veteran or military status, citizenship status or any other legally protected category. If you have a need that requires accommodation, let us know by completing our Accommodations for Applicants form.

Loading job application...