Remote DatasetsΒΆ

cdlib allows to retrieve existing datasets, along with their ground truth partitions (if available), from an ad-hoc remote repository.


The following features are still under testing: therefore, they are accessible only on the GitHub version of the library.

available_networks() List the remotely available network datasets.
available_ground_truths() List the remotely available network ground truth datasets.
fetch_network_data([net_name, net_type]) Load the required network from the remote repository
fetch_ground_truth_data([net_name, graph]) Load the required ground truth clustering from the remote repository
fetch_network_ground_truth([net_name, net_type]) Load the required network, along with its ground truth partition, from the remote repository.