Gain full visibility across your data landscape, find meaning in your data and improve the quality of business decisions.
Discover and download solutions and pre-built integrations for the Collibra Platform.
Get unparalleled value through the combined expertise and unique strengths of our people and technology.
See how security plays a key role in everything from how we build and deliver our platform to how we hire and train employees.
Collibra Privacy & Risk
Discover and understand data that matters so you can generate impactful insights that drive business value.
Understand your ever-growing amount of data in a way that scales with growth and change.
Show how data sets are built, aggregated, sourced and used, providing complete, end-to-end lineage visualization.
Build customer trust by operationalizing privacy policies and scaling compliance across new regulations.
Modernize your operations with a solution that is scalable, accessible and resilient: data in the cloud.
Drive digital growth and customer engagement by breaking down data silos and adding value to customer interactions.
Fuel your self-services analytics with the right data to develop unique business insights.
Innovate for the future while successfully navigating the complex web of regulations.
Transform decision making in the public sector with secure Data Intelligence that is FedRAMP Authorized.
Cloud ready data
Government and public sector
Tap into our knowledge base by connecting, sharing and learning from your peers in our Data Citizens community.
See how Collibra is helping global organizations unlock the value of their data.
Find the resources you need to accelerate time to value and fuel your growth.
Learn from the leaders in Data Intelligence through our individual courses, learning paths, and certification programs.
Data Citizens '20
Take your data strategy to the next level by arming yourself with the knowledge you need to achieve Data Intelligence.
Get advice, tips and tricks from our product experts and industry thought leaders to learn how to make your data meaningful.
Join the world’s largest virtual gathering of professionals focused on empowering businesses to deliver on strategic goals through Data Intelligence.
Check our upcoming events calendar to discover exciting opportunities to learn from our product and industry experts.
Connect the right data, insights, algorithms and people to optimize processes, increase efficiency and drive innovation.
Read our latest announcements, news coverage and thought leadership articles.
Find an opportunity to challenge and be challenged, and work with some of the most talented people in the business.
Get in touch with a member of our global team by locating an office near you, calling us or sending an email.
After the action-packed week that was AWS re:Invent, it’s time to reflect on the lessons we learned and explore how we can take action. Rather than focus on the flurry of news (you can always go to their website to see all of the product announcements), we thought it was best to summarize the conversations that were had to help coordinate our focus as we head into 2019.
Below are our summaries of some of those topics.
Data Catalogs are a New Focus for Enterprise Architects
For the last few of years, there has been increasing volume of conversations on driving data availability and usage through the use of a catalog, typically within a single business unit or across the small group of passionate data evangelists and scientists. Based on our experience at re:Invent, however, it’s rapidly becoming an enterprise-wide challenge that the Enterprise Architect is challenged to solve. The rationale for this is pretty simple: to meet the broader challenges of AI/ML or business imperatives like Customer 360, it requires aggregating data from across the entire organization. But it also creates a series of challenges. This was a message reiterated throughout several speaking sessions at this year’s event.
The first challenge discussed was that any catalog solution needs to be open and supported by strong native integrations with the most relevant enterprise data platforms, and they must be easy to integrate with via a solid API layer that is supported by a wide ecosystem of implementation expert.
Second, automation on the ingestion, including profiling and tagging, is paramount (this is the largest chasm to cross when setting up an enterprise catalog). This is meant to speed up and handle the vast volumes of data that needs to be added. There were many discussions on how AI/ML is advancing in this area.
Last key challenge was that it needs to be inclusive of not only data assets, but all of the follow-on assets including analytics dashboards, workbooks, and worksheets.
Data Governance Along with Catalogs, Accelerating (Multi-)Cloud Push
While there was a lot of buzz at the show around new ways to use data (Amazon Quicksight (BI), Amazon Forecast (ML), AWS SageMaker (ML) updates), many of the conversations we had were on how to handle the management and accelerate the movement of data so that it can be used in all of these developing areas. The challenge here is how to provide visibility across the data landscape, which is still a mix of on-premises and in the cloud (and, increasingly, multi-cloud).
While there is recognition in the value of a data catalog here as well, it is clear that companies have looked a bit deeper at the problem and requirements to solve it. Discussions expanded to include how to manage workflows across the key players involved, which include Analysts, Data Engineers, and Data Stewards. This facilitation and collaboration between the business and IT organization were viewed as an imperative since most tools used today are intended only for the deep experts and not for the faint of heart. Ultimately, they’re not getting the engagement and usage out of their lake/warehouse as they (or the business) expected.
Another point raised was that with the rise of Kubernetes and Docker, it’s increasingly easy to spin up siloed and highly temporary mini data lakes to perform ad-hoc data analytics or machine learning data workbenches. Also, in the case of these ad-hoc data islands for data scientists, it is of the utmost importance to understand the data lineage and data provenance. The fact that applications and data are becoming increasingly “mobile” places more of a focus on lineage. This need is viewed as not only providing traceability from analytics down to the source, but also increasing visibility on who or what (people, process, and tools) use what data. Collibra has seen this visibility provide better quality data because it is both proactive and reactive in that it uncovers errors before they becoming a problem, but also helps in resolving issues when they arise. This is one factor that delivers improved trust in the data by the end user which, in turn, increases usage.
I am sure these areas will continue to come to the forefront, especially considering that AWS mentioned they had more than 10,000 companies using them for creating their data lakes.
Privacy Now a Focus of a CIO and CTO
The last big topic that came out of re:Invent was data privacy. With the advent of new regulations like GDPR or California Consumer Privacy Act (CCPA), and the volume in the press on latest public breaches, organizations are demanding a more systematic approach to data privacy vs. the ad-hoc way it is done today. This shift at re:Invent was more pronounced since the CIO and CTO are now the individuals raising the question. The challenge is that companies have not been sitting idle, rather they have implemented a vast array of point technologies (and still some manual) across privacy, data protection, architecture, ontologies, policies, etc.; yet, there is no system of record across these areas. We see 2019 as being a year where “privacy by design” becomes the model organizations adopt to provide a true record of processing activities and point of integration.
So, what should the focus be as we head into 2019 based on this insight?
Want to continue reading about data catalogs? Download our e-book, A Comprehensive Guide to the Data Catalog, to learn more.
We accelerate trusted business outcomes by connecting the right data, insights, algorithms and people for all Data Citizens.
© 2020 Collibra. All Rights Reserved.
A message to our Collibra community on COVID-19. Read more from our CEO.