DataLad

What is DataLad?
DataLad is a Python-based distributed data management system that keeps track of your data with version control, creates structure, ensures reproducibility, supports collaboration, and integrates with widely used data infrastructure. It is developed and maintained as a free and open source project by a global and interdisciplinary community of scientists.
DataLad's purpose is to make data management and data distribution more accessible. To do so, it stands on the shoulders of Git and Git-annex to deliver a decentralized system for data exchange. This includes automated ingestion of data from online portals and exposing it in readily usable form as Git(-annex) repositories - or datasets. However, the actual data storage and permission management remains with the original data provider(s).
DataLad is not just a single software package. Numerous extension packages can equip the base package with additional functionality, or even tailor and tune the way the base package works. Beyond being used by individual researchers, the software is a foundational element in various national and international consortia as a front- or back-end.
The primary goal of the DataLad project is to support the collaborative process of distilling knowledge from data according to the FAIR Guiding Principles — Findability, Accessibility, Interoperability, and Reusability.
Visit DataLad to find out more.