Aviva insurance tweets
A small dataset of 399 tweets about AVIVA insurance. The CSV file includes tweets from 26/06/2014 to 27/06/2014.
- AVIVA tweets [115KB]
Million graph data
The enriched graph used for join operator. The vertices have been enriched following the LDBC Social Network Benchmark protocol. The TXT files contain vertices and edges in a comma and tab separated format.
- Enriched Graph Data [344MB]
- LiveJournal edges
We also provide the dataset that we used to benchmark the graph nesting operator. In particular, we offer both the gMark -generated subgraphs and authorship's Microsoft Academic Graph subgraphs.
Public figures Facebook posts
The dataset includes photos taken by 19 different smartphones, both from the front camera and the rear camera. For each smartphone a subset of 100 images (50 from the front camera and 50 from the rear one) was uploaded and downloaded on the following Social Media: Facebook, Flickr, Google+, GPhoto, Instagram, LinkedIn, Pinterest, QQ, Telegram, Tumblr, Twitter, Viber, VK, WeChat, WhatsApp and WordPress. The Readme.csv file summarizes the smartphones' characteristics.
Time in Text
This dataset comprises the time intervals extracted and normalized from the temporal expressions found in text corpora.
Text Watermarking evaluation
The following datasets were used for evaluating the robustness of our Fine-grain text watermarking method