Gain full visibility across your data landscape, find meaning in your data and improve the quality of business decisions.
Discover and download solutions and pre-built integrations for the Collibra Platform.
Get unparalleled value through the combined expertise and unique strengths of our people and technology.
See how security plays a key role in everything from how we build and deliver our platform to how we hire and train employees.
Collibra Privacy & Risk
Discover and understand data that matters so you can generate impactful insights that drive business value.
Understand your ever-growing amount of data in a way that scales with growth and change.
Show how data sets are built, aggregated, sourced and used, providing complete, end-to-end lineage visualization.
Build customer trust by operationalizing privacy policies and scaling compliance across new regulations.
Modernize your operations with a solution that is scalable, accessible and resilient: data in the cloud.
Drive digital growth and customer engagement by breaking down data silos and adding value to customer interactions.
Fuel your self-services analytics with the right data to develop unique business insights.
Innovate for the future while successfully navigating the complex web of regulations.
Transform decision making in the public sector with secure Data Intelligence that is FedRAMP Authorized.
Cloud ready data
Government and public sector
Tap into our knowledge base by connecting, sharing and learning from your peers in our Data Citizens community.
See how Collibra is helping global organizations unlock the value of their data.
Find the resources you need to accelerate time to value and fuel your growth.
Learn from the leaders in Data Intelligence through our individual courses, learning paths, and certification programs.
Data Citizens '20
Take your data strategy to the next level by arming yourself with the knowledge you need to achieve Data Intelligence.
Get advice, tips and tricks from our product experts and industry thought leaders to learn how to make your data meaningful.
Join the world’s largest virtual gathering of professionals focused on empowering businesses to deliver on strategic goals through Data Intelligence.
Check our upcoming events calendar to discover exciting opportunities to learn from our product and industry experts.
Connect the right data, insights, algorithms and people to optimize processes, increase efficiency and drive innovation.
Read our latest announcements, news coverage and thought leadership articles.
Find an opportunity to challenge and be challenged, and work with some of the most talented people in the business.
Get in touch with a member of our global team by locating an office near you, calling us or sending an email.
I have been a proud, card-carrying member of the Hunter-Gatherer Society for over three decades. By the Hunter-Gatherer Society, I mean all the processes that data analysts and data scientists spend the majority of their time on: gathering up and preparing data to use for the analytic models and reporting they are responsible for delivering to the business.
Over the course of my career, I faced different challenges than exist today. While there was a limited amount of data, all of the applications were internally-developed. The documentation was limited and out-of-date. The semantics and meaning of the data were limited to a few that had tribal knowledge. But these few were not incented to share their knowledge. I could not use the data where it existed in the application, so I had to extract it manually and integrate it in a different data structure. The gathering, integration, standardization, and cleansing of the data was hand coded, and generally took 15 to 18 days per month. That left me with two to five days per month to actually conduct the analytics and make recommendations. I often spent 90-95% of the month gathering the data and only 5-10% conducting the analytics and analysis to make recommendations.
Data warehousing architecture, techniques, and technologies have improved the gathering portion of the challenges. However, we still face challenges with finding the right data given the increasing volume and variety. I recently read articles that suggest today’s data scientists spend an average of 75% of their time finding the right data, validating the quality and controls applied, integrating with other data sets, and then preparing it for analytics. Thus, all our efforts with data warehousing may not have made the significant improvements in the challenges of the society.
Data scientists today face greater challenges than what I faced decades ago. What are the challenges that they must address? They need to ask the questions such as:
We are addressing these challenges with the Collibra Catalog. The data catalog is a new approach that many data scientists and analysts look forward to leveraging. I know I am. And while the data catalog certainly benefits all data citizens across the organization, let’s take a closer look at how it helps data scientists specifically.
A data catalog creates a better method for analytics professionals to easily find and determine the value of trusted data. It is a single source that gives analysts a view of the data that the organization maintains. It contains the metadata such as definitions of the data objects such as tables, columns, synonyms, value ranges, indexes, consumer groups, and accountable parties. And the data catalog provides a process to view the physical data environment that has a linkage to the business glossary. Thus it has the capability of linking the technical metadata with the business metadata to answer all of the previous questions.
The data catalog is an effective technology that reduces the duration of time data scientists need to conduct the “hunting and gathering” activities. Knowing the quality, controls, lineage, and authoritative source of data will also reduce the duration of time data scientists need for standardization, cleansing, and scrubbing the data prior to integration. Understanding the business definitions and usage constraints will additionally reduce the duration of time required.
We may never make the self-service analytics environment completely automatic, as the use cases are broad and complex. Yet, the data catalog is a significant tool for the data scientist and should reduce the percentage of time necessary to conduct the Hunter-Gatherer activities. That will provide the data scientist with additional time to actually conduct the analytics activities that the organization hired them to perform. And that should lead to greater business value and opportunities from analytics. Let’s all leverage the Collibra Catalog and stick a spear in the Hunter-Gatherer Society. We need to improve the productivity of the data scientist while improving the trust in our analytics.
We accelerate trusted business outcomes by connecting the right data, insights, algorithms and people for all Data Citizens.
© 2020 Collibra. All Rights Reserved.
A message to our Collibra community on COVID-19. Read more from our CEO.