One-in-five Tweets divulge their location

A new study from USC researchers sampled more than 15 million tweets, showing that some Twitter users may be inadvertently revealing their location through updates on the social media channel.
The study, which appears in the current issue of the International Journal of Geoinformatics, provides important factual data for a growing national conversation about online privacy and third-party commercial or government use of geo-tagged information.
"I'm a pretty private person, and I wish others would be more cautious with the types of information they share," said lead author Chris Weidemann, a graduate student in the Geographic Information Science and Technology (GIST) online master's program at the USC Dornsife College of Letters, Arts and Sciences. "There are all sorts of information that can be gleaned from things outside of the tweet itself."
Twitter has approximately 500 million active users who are expected to tweet 72 billion times in 2013. Reports have shown that about six percent of users opt-in to allow the platform to broadcast their location with every tweet.
But that's only part of the footprint Twitter users leave, and even users who have not opted-in for location tagging may be inadvertently revealing where they are, the study shows.
To get a fuller sense of what publicly accessible data might reveal about Twitter users, Weidemann developed an application called Twitter2GIS to analyze the metadata collected by Twitter, including details about the user's hometown, time zone and language.
The data, generated by Twitter users and available through Twitter's application programming interface (API) and Google's Geocoding API, was then processed by a software program, which mapped and analyzed the data, searching for trends.
During the study's one-week sampling period, roughly 20 percent of the tweets collected showed the user's location to an accuracy of street level or better.
Many Twitter users divulged their physical location directly through active location monitoring or GPS coordinates. But another 2.2 percent of all tweets -- equating to about 4.4 million tweets a day -- provided so-called "ambient" location data, where the user might not be aware they are divulging their location.
"The downside is that mining this kind of information can also provide opportunities for criminal misuse of data," Weidemann said. "My intent is to educate social media users and inform the public about their privacy."
In addition to being a graduate student at USC, Weidemann works for a company that builds geographic information systems for the federal government. He initially developed Twitter2GIS as part of a capstone project for a course taught by Jennifer Swift, associate teaching professor of spatial sciences at USC.
Swift, Weidemann's thesis adviser, said the project stood out for its thoughtful look at geospatial information.
"It will help create an awareness among the general population about the information they divulge," said Swift, a co-author on the study.
Weidemann is a self-described "conservative" Twitter user, using the social media channel infrequently. He has the privacy set to not share any location information about his tweets. Still, in the course of doing this study, he turned Twitter2GIS on his own account and was surprised at the specificity the application was able to find about his location, based on a hashtag he used about an academic conference.
"This research has been fun," Weidemann said. "And a little scary."
Weidemann is opening up Twitter2GIS to the public, expanding it to allow Twitter users to login with their profile credentials so they can view their own location footprint. Test out the beta version and provide feedback at http://geosocialfootprint.com. http://www.sciencedaily.com/releases/2013/09/130903194151.htm
http://mashable.com/2010/03/10/twitter-geolocation-tweets/Â
Every Tweet you've ever posted will be available to anyone:
Topsy is backfilling its social search engine to include every public tweet ever published on Twitter, the company will announce Wednesday. The inventory addition means the company's analytics tools can now go all the way back in time to Twitter's first tweet, penned by co-founder Jack Dorsey on March 21, 2006.
Founded in 2007, Topsy, an index of the social Web, started out as a social search company that has since become adept at sentiment analysis and has outlasted a number of now defunct social search startups. Along the way, Topsy has become besties with Twitter, powering the information network's indices for the 2012 presidential race and the 2013 Oscars.
The release positions the company as a top destination for deep dives into social data. To drive home the point, Topsy said it now houses more than 425 billion tweets, videos, images, and blog posts, which adds up to more social data than Bing or Google.
"By adding a full historical index, now we can look even further back to the very first tweets 7 years ago, meaning our users have access to the best, most accurate view of the world's social conversation," Topsy co-founder and CTO Vipul Ved Prakash said in a statement.
http://news.cnet.com/8301-1023_3-57601209-93/topsy-indexes-entire-tweet-history-for-search-analysis/