Data gathers metadata about software from 33 packaage managers and three repository hosts — GitHub, GitLab and BitBucket.

The data we collect powers search on and other sites (if you’d like us to do that for you get in touch, helps us recommend projects, warn users of problems with their dependencies and using SourceRank.

Downloading Libraries’ Data

Data gathered by is avaialble for download on Zenodo. Documentaiton on what’s included can be found at

Acccessing Libraries’ Data Programatically

Libraries’API provides up-to-date information for a project, repository or user. Data will soon be avaialble via Google BigQuery as apart of their public datasets programme.


All data gathered by is provided under a Creative Commons, Attribution Sharealike 4.0 Licence. You can read the full wording of the licence here.


Our strategy is to gather and share information about open source software to create a stronger ecosystem and help developers make more informated decisions about the software they use. If there’s some data that does not currently collect that you would like to use in your research, application or service please create a ticket for it.