A new Web service searches the change logs of 35 million Wikipedia edits and attempts to identify the organization associated with the IP addresses recorded with changes.
A Cal Tech grad student has developed search software that may help bring more accountability to Wikipedia, the online encyclopedia that's open to editing by just about anyone.
Virgil Griffith has created WikiScanner, a Web service that searches the change logs of 35 million Wikipedia edits and attempts to identify the organization associated with the IP addresses recorded with changes.
Wikipedia has long been the target of editing for the sake of image management, not to mention disparagement. Last year, for example, U.S. Senate staff members, using IP addresses associated with the U.S. Senate, altered Wikipedia articles to remove information that reflected poorly on various senators.
In addition to a generic submission form that accepts queries by organization, IP address, or Wikipedia page, WikiScanner provides a list of links that execute pre-set queries for high-profile companies that are or have been embroiled in political or social controversies. These companies include Amgen, Diebold, ExxonMobil, Pfizer, and Wal-Mart. The site also includes links to query about media organizations like Al-Jazeera, The New York Times, and Fox News.
The Fox News link reveals, for example, that Oct. 11, 2005, someone at an IP address associated with Fox News (18.104.22.168) edited Fox News anchor Shepard Smith's Wikipedia entry to remove a paragraph about Smith's 2000 arrest "for aggravated battery with a motor vehicle." At some later point, that information was restored.
Wired News maintains an ongoing list of "the Most Shameful Wikipedia Spin Jobs," which includes efforts by numerous companies to improve their public image by removing unflattering information.
While WikiScanner falls short of identifying those altering Wikipedia articles, Griffith believes identifying the network of origin for Wikipedia edits is still useful. "Technically, we don't know whether it came from an agent of that company, however, we do know that edit came from someone with access to their network," he explains on his Web site. "If the edit occurred during working hours, then we can reasonably assume that the person was an agent of that company or was a guest that was allowed access to their network."
Earlier this year, Wikipedia founder Jimmy Wales suggested adopting a scheme to verify credentials claimed by Wikipedia contributors following revelations about a prominent contributor with a fabricated background.
The Agile ArchiveWhen it comes to managing data, donít look at backup and archiving systems as burdens and cost centers. A well-designed archive can enhance data protection and restores, ease search and e-discovery efforts, and save money by intelligently moving data from expensive primary storage systems.
2014 Analytics, BI, and Information Management SurveyITís tried for years to simplify data analytics and business intelligence efforts. Have visual analysis tools and Hadoop and NoSQL databases helped? Respondents to our 2014 InformationWeek Analytics, Business Intelligence, and Information Management Survey have a mixed outlook.
Join us for a roundup of the top stories on InformationWeek.com for the week of December 14, 2014. Be here for the show and for the incredible Friday Afternoon Conversation that runs beside the program.