Designing, operating and managing a Multi-purpose Data Lake
Start Date: 2021-05-24
End Date: 2021-05-25
Time: 9:00 am to 5:30 pm
Most organizations today are dealing with multiple silos of information. These include Cloud and on-premises based transaction processing systems, multiple Data Warehouses, Data Marts, Reference Data management (RDM) systems, Master Data Management (MDM) systems, Content Management (ECM) systems and more recently Big Data NoSQL platforms such as Hadoop and other NoSQL databases.
In addition the number of data sources is increasing dramatically especially from outside the enterprise. Given this situation it is not surprising that many companies have ended up managing information in silos with different tools being used to prepare and manage data across these systems with varying degrees of governance.
In addition, it is not only IT that is now integrating data. Business users are also getting involved with new self-service data preparation tools. The question is, is this the only way to manage data? Is there another level that we can get reach to allow us to more easily manage and govern data across an increasingly complex data landscape?
This seminar looks at the challenges faced by companies trying to deal with an exploding number of data sources, collecting data in multiple data stores (Cloud and on-premises), multiple analytical systems and at the requirements to be able to define, govern, manage and share trusted high quality information in a distributed and hybrid computing environment.
It also explores a new approach of how IT data architects, business users and IT developers can collaborate together in building and managing a Logical Data Lake to get control of your data. This includes data ingestion, automated data discovery, data profiling and tagging and publishing data in an information catalog.
It also involves refining raw data to produce Enterprise Data Services that can be published in a catalog available for consumption across your company. We also introduce multiple Data Lake configurations including a centralized Data Lake and a ‘logical’ distributed Data Lake as well as execution of jobs and governance across multiple data stores.
It emphatizes the need for a common collaborative process and common approach to governing and managing data of all types.
Click the link for registration or ticketing information:
Organized by : Technology Transfer