DatAasee -- A Metadata-Lake as Metadata Catalog for a Virtual Data-Lake
It addresses the problem of metadata management for distributed data sources in research and library contexts, but the contribution is incremental.
The paper proposes a metadata-lake architecture to manage metadata from distributed data sources in a research-data and library-oriented setting, and presents a proof-of-concept implementation.
Metadata management for distributed data sources is a long-standing but ever-growing problem. To counter this challenge in a research-data and library-oriented setting, this work constructs a data architecture, derived from the data-lake: the metadata-lake. A proof-of-concept implementation of this proposed metadata aggregator is presented and briefly evaluated.