The resulting dataset is a CSV file containing 3748 tweets tagged with #HASTAC2014 (case not sensitive).
The first tweet in the dataset is dated 19/04/2014 23:10:50 Lima, Perú time and the last one is dated 27/04/2014 15:00:54 also Lima, Perú time. The file also contains equivalent times in GMT.
HASTAC is an alliance of humanists, artists, social scientists, scientists and technologists working together to transform the future of learning for the 21st century. Since 2002, HASTAC (“haystack”) has served as a community of connection where 11,500+ members share news, tools, research, insights, and projects to promote engaged learning for a global society.
HASTAC 2014: Hemispheric Pathways: Critical Makers in International Networks, the 6th international conference for the Humanities, Arts, Science, and Technology Alliance and Collaboratory, was hosted by the Ministerio Cultura of Lima, Perú, from 6pm Wednesday 23 April to 1pm Sunday 27 April 2014 local time. In order to avoid the inclusion of spam tweets the minimum number of followers a person had to have to be included in the archive was two.
I harvested the tweets with (several!) Twitter Archiving Google Spreadsheets (TAGS version 5.1, by Martin Hawksey).
Please note that both research and experience show that the Twitter search API isn’t 100% reliable. Large tweet volumes affect the search collection process as well. The API might “over-represent the more central users”, not offering “an accurate picture of peripheral activity” (González-Bailón, Sandra, et al. 2012). Therefore, it cannot be guaranteed this file contains each and every tweet tagged with #HASTAC2014 during the indicated period.
[It should go without saying but perhaps it must also be noted that some conference tweets might have used other variations of the hashtag. Logically those were not included in this collection. Therefore it cannot be said that even all tweets tagged #HASTAC2014 represent all the Twitter activity around the 2014 conference.]
The file includes raw data and it might require refining including deduplication. The data is shared as is.
The file is openly accessible via figshare:
Priego, Ernesto (2014): #HASTAC2014 Conference Tweets Archive from 19 April to 25 April 2014. figshare.
[I have just published this and the doi might take some time to become active].
The URL for the dataset is
The file is shared with a Creative Commons- Attribution license (CC-BY).
I have been archiving conference tweets and sharing backchannel datasets for some time now. I am keen on promoting the study of academic conference networks on Twitter. By openly sharing the resulting datasets and by blogging about it throughout time, I have also been openly documenting my own learning curve trying to archive tweets and how to do it better. If you use or refer to this data in any way please cite and link back using the citation information above.
I will hopefully have time to finish and publish another post with more detail about the HASTAC conference backchannels soon.
Thank you for reading and sharing. If you attended the conference, I hope you had a nice time. As usual, I am sorry I could not attend in person.