Datasets

The enriched graph used for join operator. The vertices have been enriched following the LDBC Social Network Benchmark protocol. The TXT files contain vertices and edges in a comma and tab separated format.

Enriched Graph Data [344MB]
LiveJournal edges

We also provide the dataset that we used to benchmark the graph nesting operator. In particular, we offer both the gMark -generated subgraphs and authorship's Microsoft Academic Graph subgraphs.

gMark-generated operands [2GB]
Microsoft Academic Authorship graph [5GB]

The dataset includes photos taken by 19 different smartphones, both from the front camera and the rear camera. For each smartphone a subset of 100 images (50 from the front camera and 50 from the rear one) was uploaded and downloaded on the following Social Media: Facebook, Flickr, Google+, GPhoto, Instagram, LinkedIn, Pinterest, QQ, Telegram, Tumblr, Twitter, Viber, VK, WeChat, WhatsApp and WordPress. The Readme.csv file summarizes the smartphones' characteristics.

Images [53GB]
Readme

Aviva insurance tweets

Million graph data

Public figures Facebook posts

PubMed abstracts for biomedical clustering evaluation

Smartphone images

Smartphone videos

Time in Text

Text Watermarking evaluation