Datasets

Gowalla Dataset

This dataset was collected from Gowalla, a popular location-based social network, which has more than 600,000 users since November 2010 and was acquired by Facebook in December 2011. In practice, we used the Gowalla APIs to collect the user profiles, user friendship, location profiles, and users’ check-in history made before June 1, 2011. Finally, we have obtained 36,001,959 check-ins made by 319,063 users over 2,844,076 locations. The locations in Gowalla are grouped into 7 main categories, i.e., Community, Entertainment, Food, Nightlife, Outdoors, Shopping and Travel, and each main category consists of several subcategories.

You can download this dataset from here (about 350MB). This dataset is released solely for research purpose. Please cite at least one of the following 2 papers if you use this dataset in your research.

  • Yong Liu, Wei Wei, Aixin Sun, Chunyan Miao, “Exploiting Geographical Neighborhood Characteristics for Location Recommendation”, In Proceedings of the 23rd ACM International Conference on Information and Knowledge Management (CIKM’14), pp. 739-748. ACM, 2014.

  • Xin Liu, Yong Liu, Karl Aberer, Chunyan Miao, “Personalized Point-of-Interest Recommendation by Mining Users’ Preference Transition”, In Proceedings of the 22nd ACM International Conference on Information and Knowledge Management (CIKM’13), pp. 733-738. ACM, 2013.

Weeplaces Dataset

This dataset is collected from Weeplaces, a website that aims to visualize users’ check-in activities in location-based social networks (LBSN). It is now integrated with the APIs of other location-based social networking services, e.g., Facebook Places, Foursquare, and Gowalla. Users can login Weeplaces using their LBSN accounts and connect with their friends in the same LBSN who have also used this application. All the crawled data is originally generated in Foursquare. This dataset contains 7,658,368 check-ins generated by 15,799 users over 971,309 locations. In the data collection, we can’t get the original Foursquare IDs of the Weeplaces users. We can only get their check-in history, their friends who also use Weeplaces, and other additional information about the locations.

You can download this dataset from here (about 140 MB). Note that this dataset is released solely for research purpose.