{"id":1613,"date":"2007-10-30T15:39:39","date_gmt":"2007-10-30T15:39:39","guid":{"rendered":"http:\/\/rinf.com\/alt-news\/surveillance-big-brother\/att-invents-programming-language-for-mass-surveillance\/1613\/"},"modified":"2007-10-30T15:41:35","modified_gmt":"2007-10-30T15:41:35","slug":"att-invents-programming-language-for-mass-surveillance","status":"publish","type":"post","link":"http:\/\/rinf.com\/alt-news\/surveillance-big-brother\/att-invents-programming-language-for-mass-surveillance\/","title":{"rendered":"The Programming Language for Mass Surveillance"},"content":{"rendered":"<p><span style=\"margin-right: 20px\"><span id=\"contributor\" class=\"c cs\">According to government documents <a href=\"http:\/\/www.nytimes.com\/2007\/09\/09\/washington\/09fbi.html\">studied by The New York Times<\/a>, the FBI asked several phone companies to analyze phone-call patterns of Americans using a technology called \u201ccommunities of interest\u201d. Verizon refused, saying that it didn\u2019t have any such technology. AT&amp;T, famously, did not refuse.<\/span><\/span><span style=\"margin-right: 20px\"><span id=\"contributor\" class=\"c cs\">What is the \u201ccommunities of interest\u201d technology? It\u2019s spelled out very clearly in a 2001 research paper from AT&amp;T itself, entitled \u201c<a href=\"http:\/\/citeseer.ist.psu.edu\/cortes01communities.html\">Communities of Interest<\/a>\u201d (by C. Cortes, D. Pregibon, and C. Volinsky). They use high-tech data-mining algorithms to scan through the huge daily logs of every call made on the AT&amp;T network; then they use sophisticated algorithms to analyze the connections between phone numbers: who is talking to whom? The paper literally uses the term \u201cGuilt by Association\u201d to describe what they\u2019re looking for: what phone numbers are in contact with other numbers that are in contact with the bad guys?<\/p>\n<p>When this research was done, back in the last century, the bad guys where people who wanted to rip off AT&amp;T by making fraudulent credit-card calls. (Remember, back in the last century, intercontinental long-distance voice communication actually cost money!) But it\u2019s easy to see how the FBI could use this to chase down anyone who talked to anyone who talked to a terrorist. Or even to a \u201cterrorist.\u201d<\/p>\n<p><strong>AT&amp;T Invents Surveillance Programming Language<\/strong>\u00a0\u00a0<\/p>\n<p><span style=\"margin-right: 20px\"><span class=\"c cs\">By Ryan Singel<\/span><\/span><\/p>\n<p><\/span><\/span><span style=\"margin-right: 20px\"><span class=\"c cs\"><\/span><\/span>From the company that brought you the C programming language comes Hancock, a C variant developed by AT&amp;T researchers to mine gigabytes of the company&#8217;s telephone and internet records for surveillance purposes.<\/p>\n<p><img loading=\"lazy\" decoding=\"async\" width=\"399\" src=\"http:\/\/blog.wired.com\/photos\/uncategorized\/2007\/10\/29\/guilt_by_association_3.jpg\" height=\"567\" style=\"width: 399px; height: 567px\" \/><\/p>\n<p>An <a href=\"http:\/\/citeseer.ist.psu.edu\/cortes01communities.html\">AT&amp;T research paper<\/a> published in 2001 and unearthed today by Andrew Appel at <a href=\"http:\/\/www.freedom-to-tinker.com\/?p=1219\">Freedom to Tinker<\/a> shows how the phone company uses Hancock-coded software to crunch through tens of millions of long distance phone records a night to draw up what AT&amp;T calls &#8220;communities of interest&#8221; &#8212; i.e., calling circles that show who is talking to whom.<\/p>\n<p>The system was built in the late 1990s to develop marketing leads, and as a security tool to see if new customers called the same numbers as previously cut-off fraudsters &#8212; something the paper refers to as &#8220;guilt by association.&#8221;<\/p>\n<p>But it&#8217;s of interest to THREAT LEVEL because of recent revelations that the FBI has been requesting &#8220;communities of interest&#8221; records from phone companies under the USA PATRIOT Act without a warrant. Where the bureau got the idea that phone companies collect such data has, until now, been a mystery.<\/p>\n<p>According to a letter from Verizon to a congressional committee earlier this month, the FBI has been asking Verizon for &#8220;community of interest&#8221; records on some of its customers out to two generations &#8212; i.e., not just the people that communicated with an FBI target, but also those who talked to people who talked to an FBI target. Verizon, though, doesn&#8217;t create those records and couldn&#8217;t comply. Now it appears that AT&amp;T invented the concept and the technology. It even owns <a href=\"http:\/\/www.google.com\/patents?id=_b0LAAAAEBAJ&amp;dq=6480844&amp;num=100\">a patent on some of its data mining methods<\/a>, issued to two of Hancock&#8217;s creators in 2002.<\/p>\n<p>Programs written in Hancock work by analyzing data as it flows into a data warehouse. That differentiates the language from traditional data-mining applications which tend to look for patterns in static databases. A 2004 paper published in <em>ACM Transactions on Programming Languages and Systems<\/em> shows how Hancock code can sift calling card records, long distance calls, IP addresses and internet traffic dumps, and even track the physical movements of mobile phone customers as their signal moves from cell site to cell site.\u00a0<\/p>\n<p>With Hancock, &#8220;analysts could store sufficiently precise information to enable new applications previously thought to be infeasible,&#8221; the program authors wrote. AT&amp;T uses Hancock code to sift 9 GB of telephone traffic data a night, according to the paper.<\/p>\n<p>The good news for budding data miners is that Hancock&#8217;s <a href=\"http:\/\/www.research.att.com\/~kfisher\/hancock\/\">source code and binaries<\/a> (now up to version 2.0) are available free to noncommercial users from an AT&amp;T Research website.<\/p>\n<p>The <a href=\"http:\/\/www.research.att.com\/~kfisher\/hancock\/manual.pdf\">instruction manual<\/a> (.pdf) is also free, and old-timers will appreciate its spare Kernighan &amp; Ritchie style. The manual even includes a few sample programs in the style of K&amp;R&#8217;s Hello World, but coded specifically to handle data collected by AT&amp;T&#8217;s phone and internet switches. This one reads in a dump of internet headers, computes what IP addresses were visited, makes a record and prints them out, in less than 40 lines of code.<\/p>\n<blockquote>\n<pre>#include \"ipRec.hh\" \r\n#include \"ihash.h\" \r\n\r\nhash_table *ofInterest; \r\n\r\nint inSet (ipPacket_t * p) \r\n{ \r\n if (hash_get (ofInterest, p-&gt;source.hash_value) == 1) \r\n  return 1; \r\n if (hash_get (ofInterest, p-&gt;dest.hash_value) == 1) \r\n  return 1; \r\n return 0; \r\n} \r\nvoid <strong>sig_main<\/strong> (ipAddr_s addrs &lt; l:&gt;, \r\n{ \r\n \/* code to set up hash table *\/ \r\n ofInterest = hash_empty (); \r\n <strong>iterate<\/strong> \r\n  (<strong>over<\/strong> addrs) { \r\n  <strong>event<\/strong> (ipAddr_t * addr) { \r\n \u00a0 \u00a0if (hash_insert (ofInterest, addr-&gt;hash_value, 1) &lt; 0) \r\n  } \r\n } \r\n \/* code to select packets *\/ \r\n <strong>iterate<\/strong> \r\n  (<strong>over<\/strong> packets \r\n \u00a0 filteredby inSet) \r\n { \r\n  <strong>event<\/strong> (ipPacket_t * p) \r\n  { \r\n \u00a0 \u00a0printPacketInfo (p); \r\n  } \r\n }; \r\n}<\/pre>\n<\/blockquote>\n<p>Another sample program included in the manual shows how a Hancock program could create historical maps of a person&#8217;s travels by recording nightly what cell phone towers a person&#8217;s phone had used or pinged throughout a day.<\/p>\n<p>AT&amp;T is currently defending itself in federal court from allegations that it installed, on behalf of the NSA, secret internet spying rooms in its domestic internet switching facilities. AT&amp;T and Verizon are also accused of giving the NSA access to billions of Americans&#8217; phone records, in order to data-mine them to spot suspected terrorists, and presumably to identify targets for warrantless wiretapping.<\/p>\n","protected":false},"excerpt":{"rendered":"<p>According to government documents studied by The New York Times, the FBI asked several phone companies to analyze phone-call patterns of Americans using a technology called \u201ccommunities of interest\u201d. Verizon refused, saying that it didn\u2019t have any such technology. AT&amp;T, famously, did not refuse.What is the \u201ccommunities of interest\u201d technology? It\u2019s spelled out very clearly [&hellip;]<\/p>\n","protected":false},"author":1,"featured_media":0,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[1614],"tags":[30],"class_list":{"0":"post-1613","1":"post","2":"type-post","3":"status-publish","4":"format-standard","6":"category-surveillance-big-brother","7":"tag-big-brother"},"_links":{"self":[{"href":"http:\/\/rinf.com\/alt-news\/wp-json\/wp\/v2\/posts\/1613","targetHints":{"allow":["GET"]}}],"collection":[{"href":"http:\/\/rinf.com\/alt-news\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"http:\/\/rinf.com\/alt-news\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"http:\/\/rinf.com\/alt-news\/wp-json\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"http:\/\/rinf.com\/alt-news\/wp-json\/wp\/v2\/comments?post=1613"}],"version-history":[{"count":0,"href":"http:\/\/rinf.com\/alt-news\/wp-json\/wp\/v2\/posts\/1613\/revisions"}],"wp:attachment":[{"href":"http:\/\/rinf.com\/alt-news\/wp-json\/wp\/v2\/media?parent=1613"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"http:\/\/rinf.com\/alt-news\/wp-json\/wp\/v2\/categories?post=1613"},{"taxonomy":"post_tag","embeddable":true,"href":"http:\/\/rinf.com\/alt-news\/wp-json\/wp\/v2\/tags?post=1613"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}