Google Cloud and Collibra deepen their partnership to bring business context and semantics directly to the Dataplex Universal Catalog
The speed of innovation in the data space isn’t just accelerating; it’s hit warp speed. As organizations race to harness the power of AI, they face a recurring roadblock: a fragmented data estate. To move fast and stay competitive, teams need to unify governance, analytics and AI under one roof.
Today, we are thrilled to announce a significant expansion of the strategic partnership between Google Cloud Dataplex and Collibra. This collaboration centers on a singular mission: integrating robust data governance, business context and deep semantics directly into the heart of the modern open lakehouse. By combining Collibra’s governance leadership with the power of Google Dataplex, we are providing joint customers with a unified, trusted data experience across their entire Google Cloud ecosystem.
Why context is the currency of trust
In today’s data-driven landscape, raw tables, rows and columns are no longer sufficient. To drive reliable decision-making, data must be anchored in essential business context. Users need to know: Who owns this data? What does this field truly mean? Is this data trusted? When these questions go unanswered, everyone from data scientists to business analysts wastes time hunting for clarity, leading to widespread mistrust and stalled insights.
Collibra provides the industry-leading system of record to solve this, offering the framework where organizations define and manage their most critical data policies, glossaries and additional business semantics.
A cornerstone of our expanded partnership is a new bi-directional integration that allows this comprehensive business context to flow directly into Google Cloud’s intelligent data fabric, Dataplex. This capability ensures that the comprehensive business context and governance policies defined in Collibra are natively accessible within the Dataplex environment, providing a single, consistent view of governed data across the entire data estate. This deep integration streamlines operations and ensures consistency by:
- Automated Discovery (Inbound): Technical metadata and discovery insights from Dataplex are fed back into Collibra, ensuring the enterprise system of record is continuously updated with the latest technical reality of the data landscape.
- Enriching the Fabric (Outbound): Business context and governance policies defined in Collibra flow natively into Dataplex, making governed metadata instantly accessible to users within their existing Google Cloud workflows.
Collibra’s bi-directional integration with Google Dataplex is available in preview for our customers to start using today. More details on the integration and how to configure are available in Collibra’s product documentation.
Fueling smarter AI with a unified Semantic Layer
While business context explains the ownership and origin of your data, semantics defines the precise logic and relationships that give that data its meaning. Semantics define the precise relationships and logic within your data, ensuring that a "Net Profit" calculation in your finance dashboard matches the "Net Profit" used to train your latest ML model.
For organizations transitioning generative AI from experimental pilots to full-scale production, a governed semantic layer serves as the essential framework for reliability and scale:
- Enhanced model accuracy: High-fidelity AI outputs depend on models trained against consistently defined, high-quality data sets.
- Streamlined data preparation: By automating the mapping of disparate data points, this layer effectively solves the "80% problem" of manual data preparation.
- Greater transparency and explainability: When an AI generates an insight, users can trace the underlying logic back through a transparent, governed layer to verify its origin.
Through this deep, interoperable semantic exchange, Google Cloud and Collibra provide a single, "golden" view of data. This ensures that every tool, user and application across the enterprise is operating from the same source of truth.
Governance for the open, hybrid lakehouse
The era of the Open Lakehouse is here. As organizations shift toward open formats like Apache Iceberg, they are increasingly managing workloads across complex hybrid and multi-cloud environments. Our joint solution leverages Google’s support for Iceberg Rest Catalog. working with Collibra txo extend its robust governance controls and visibility across the entire open lakehouse. By providing centralized oversight for data regardless of its physical location, this partnership simplifies the inherent complexity of multi-cloud needs.Ultimately, our collaboration is designed to empower organizations with open data access and flexibility, ensuring they never have to sacrifice control or compliance to achieve agility.
Conclusion: Redefining data intelligence for the AI era
The partnership between Collibra and Google Cloud Dataplex represents a paradigm shift in how enterprises manage their most valuable asset. By bridging the gap between raw data and business meaning, we are empowering our joint customers to operate with unprecedented clarity and speed. Whether it’s providing better context through bi-directional metadata synchronization, fueling smarter AI with a unified semantic layer or ensuring robust governance across the modern open lakehouse, this integration provides the trusted foundation necessary for the AI era.We invite you to experience the future of governed data firsthand. Visit us at Booth #5204 during Google Cloud Next 2026 to see how our connectivity can transform your data ecosystem.
Keep up with the latest from Collibra
I would like to get updates about the latest Collibra content, events and more.
Thanks for signing up
You'll begin receiving educational materials and invitations to network with our community soon.