Abstract

Twitter is a social network with over 250 million active users who collectively generate more than 500 million tweets each day. In social sciences research, Twitter has earned the focus of extensive research largely due to its openness in sharing its public data. Twitter exposes an extensive application program-ming interfaces (APIs) that can be used to collect a wealth of social data. In this chapter, we introduce these APIs and discuss how they can be used to conduct social sciences research. We also outline some issues that arise when using these APIs, and some strategies for collecting datasets that can give insight into a particular event. Introduction Twitter is a rich data source that provides several forms of information generated through the interaction of its users. These data can be harnessed to accomplish a variety of personalization and prediction tasks. Recently, Twitter data have been used to predict things as diverse as election results (Tumasjan et al., 2010; c.f. Chapter 2) or the location of earthquakes (Sakaki et al., 2010; c.f. Chapter 6). Twitter currently has over 250 million active users who collectively generate more than 500 million tweets each day. This creates a unique opportunity to conduct large-scale studies on user behavior. An important step before conducting such studies is the identification and collection of data relevant to the problem. Twitter is an online social networking platform where the registered users can create connections and share messages with other users. Messaging on Twitter is unique, as messages are required to be at most 140 characters long, and these messages are normally broadcast to all the users on Twitter. Thus, the platform provides an avenue to share content with a large and diverse population with few resources. These interactions generate different kinds of information. Information is made accessible to the public via APIs or interfaces where requests for data can be submitted. In this chapter, we introduce different forms of Twitter data and illustrate the capabilities and restrictions imposed by the API on Twitter data analysis.

Original languageEnglish (US)
Title of host publicationTwitter: A Digital Socioscope
PublisherCambridge University Press
Pages21-51
Number of pages31
ISBN (Print)9781316182635, 9781107102378
DOIs
StatePublished - Jan 1 2015

Fingerprint

Interfaces (computer)
Application programs
Social sciences
Earthquakes

ASJC Scopus subject areas

  • Computer Science(all)

Cite this

Kumar, S., Morstatter, F., & Liu, H. (2015). Analyzing twitter data. In Twitter: A Digital Socioscope (pp. 21-51). Cambridge University Press. https://doi.org/10.1017/CBO9781316182635.001

Analyzing twitter data. / Kumar, Shamanth; Morstatter, Fred; Liu, Huan.

Twitter: A Digital Socioscope. Cambridge University Press, 2015. p. 21-51.

Research output: Chapter in Book/Report/Conference proceedingChapter

Kumar, S, Morstatter, F & Liu, H 2015, Analyzing twitter data. in Twitter: A Digital Socioscope. Cambridge University Press, pp. 21-51. https://doi.org/10.1017/CBO9781316182635.001
Kumar S, Morstatter F, Liu H. Analyzing twitter data. In Twitter: A Digital Socioscope. Cambridge University Press. 2015. p. 21-51 https://doi.org/10.1017/CBO9781316182635.001
Kumar, Shamanth ; Morstatter, Fred ; Liu, Huan. / Analyzing twitter data. Twitter: A Digital Socioscope. Cambridge University Press, 2015. pp. 21-51
@inbook{e84fc7bc102f48aeab9a5d5fc51c6f9d,
title = "Analyzing twitter data",
abstract = "Twitter is a social network with over 250 million active users who collectively generate more than 500 million tweets each day. In social sciences research, Twitter has earned the focus of extensive research largely due to its openness in sharing its public data. Twitter exposes an extensive application program-ming interfaces (APIs) that can be used to collect a wealth of social data. In this chapter, we introduce these APIs and discuss how they can be used to conduct social sciences research. We also outline some issues that arise when using these APIs, and some strategies for collecting datasets that can give insight into a particular event. Introduction Twitter is a rich data source that provides several forms of information generated through the interaction of its users. These data can be harnessed to accomplish a variety of personalization and prediction tasks. Recently, Twitter data have been used to predict things as diverse as election results (Tumasjan et al., 2010; c.f. Chapter 2) or the location of earthquakes (Sakaki et al., 2010; c.f. Chapter 6). Twitter currently has over 250 million active users who collectively generate more than 500 million tweets each day. This creates a unique opportunity to conduct large-scale studies on user behavior. An important step before conducting such studies is the identification and collection of data relevant to the problem. Twitter is an online social networking platform where the registered users can create connections and share messages with other users. Messaging on Twitter is unique, as messages are required to be at most 140 characters long, and these messages are normally broadcast to all the users on Twitter. Thus, the platform provides an avenue to share content with a large and diverse population with few resources. These interactions generate different kinds of information. Information is made accessible to the public via APIs or interfaces where requests for data can be submitted. In this chapter, we introduce different forms of Twitter data and illustrate the capabilities and restrictions imposed by the API on Twitter data analysis.",
author = "Shamanth Kumar and Fred Morstatter and Huan Liu",
year = "2015",
month = "1",
day = "1",
doi = "10.1017/CBO9781316182635.001",
language = "English (US)",
isbn = "9781316182635",
pages = "21--51",
booktitle = "Twitter: A Digital Socioscope",
publisher = "Cambridge University Press",

}

TY - CHAP

T1 - Analyzing twitter data

AU - Kumar, Shamanth

AU - Morstatter, Fred

AU - Liu, Huan

PY - 2015/1/1

Y1 - 2015/1/1

N2 - Twitter is a social network with over 250 million active users who collectively generate more than 500 million tweets each day. In social sciences research, Twitter has earned the focus of extensive research largely due to its openness in sharing its public data. Twitter exposes an extensive application program-ming interfaces (APIs) that can be used to collect a wealth of social data. In this chapter, we introduce these APIs and discuss how they can be used to conduct social sciences research. We also outline some issues that arise when using these APIs, and some strategies for collecting datasets that can give insight into a particular event. Introduction Twitter is a rich data source that provides several forms of information generated through the interaction of its users. These data can be harnessed to accomplish a variety of personalization and prediction tasks. Recently, Twitter data have been used to predict things as diverse as election results (Tumasjan et al., 2010; c.f. Chapter 2) or the location of earthquakes (Sakaki et al., 2010; c.f. Chapter 6). Twitter currently has over 250 million active users who collectively generate more than 500 million tweets each day. This creates a unique opportunity to conduct large-scale studies on user behavior. An important step before conducting such studies is the identification and collection of data relevant to the problem. Twitter is an online social networking platform where the registered users can create connections and share messages with other users. Messaging on Twitter is unique, as messages are required to be at most 140 characters long, and these messages are normally broadcast to all the users on Twitter. Thus, the platform provides an avenue to share content with a large and diverse population with few resources. These interactions generate different kinds of information. Information is made accessible to the public via APIs or interfaces where requests for data can be submitted. In this chapter, we introduce different forms of Twitter data and illustrate the capabilities and restrictions imposed by the API on Twitter data analysis.

AB - Twitter is a social network with over 250 million active users who collectively generate more than 500 million tweets each day. In social sciences research, Twitter has earned the focus of extensive research largely due to its openness in sharing its public data. Twitter exposes an extensive application program-ming interfaces (APIs) that can be used to collect a wealth of social data. In this chapter, we introduce these APIs and discuss how they can be used to conduct social sciences research. We also outline some issues that arise when using these APIs, and some strategies for collecting datasets that can give insight into a particular event. Introduction Twitter is a rich data source that provides several forms of information generated through the interaction of its users. These data can be harnessed to accomplish a variety of personalization and prediction tasks. Recently, Twitter data have been used to predict things as diverse as election results (Tumasjan et al., 2010; c.f. Chapter 2) or the location of earthquakes (Sakaki et al., 2010; c.f. Chapter 6). Twitter currently has over 250 million active users who collectively generate more than 500 million tweets each day. This creates a unique opportunity to conduct large-scale studies on user behavior. An important step before conducting such studies is the identification and collection of data relevant to the problem. Twitter is an online social networking platform where the registered users can create connections and share messages with other users. Messaging on Twitter is unique, as messages are required to be at most 140 characters long, and these messages are normally broadcast to all the users on Twitter. Thus, the platform provides an avenue to share content with a large and diverse population with few resources. These interactions generate different kinds of information. Information is made accessible to the public via APIs or interfaces where requests for data can be submitted. In this chapter, we introduce different forms of Twitter data and illustrate the capabilities and restrictions imposed by the API on Twitter data analysis.

UR - http://www.scopus.com/inward/record.url?scp=84954229346&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=84954229346&partnerID=8YFLogxK

U2 - 10.1017/CBO9781316182635.001

DO - 10.1017/CBO9781316182635.001

M3 - Chapter

SN - 9781316182635

SN - 9781107102378

SP - 21

EP - 51

BT - Twitter: A Digital Socioscope

PB - Cambridge University Press

ER -