<?xml version="1.0" encoding="UTF-8"?>
<!DOCTYPE article
  PUBLIC "-//NLM//DTD Journal Publishing DTD v3.0 20080202//EN" "http://dtd.nlm.nih.gov/publishing/3.0/journalpublishing3.dtd">
<article xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink" article-type="research-article" dtd-version="3.0" xml:lang="EN">
<front>
<journal-meta><journal-id journal-id-type="nlm-ta">PLoS ONE</journal-id><journal-id journal-id-type="publisher-id">plos</journal-id><journal-id journal-id-type="pmc">plosone</journal-id><!--===== Grouping journal title elements =====--><journal-title-group><journal-title>PLoS ONE</journal-title></journal-title-group><issn pub-type="epub">1932-6203</issn><publisher>
<publisher-name>Public Library of Science</publisher-name>
<publisher-loc>San Francisco, USA</publisher-loc></publisher></journal-meta>
<article-meta><article-id pub-id-type="publisher-id">10-PONE-RA-21553R1</article-id><article-id pub-id-type="doi">10.1371/journal.pone.0014248</article-id><article-categories><subj-group subj-group-type="heading"><subject>Research Article</subject></subj-group><subj-group subj-group-type="Discipline"><subject>Computer Science/Applications</subject><subject>Mathematics/Algorithms</subject><subject>Physics/Interdisciplinary Physics</subject></subj-group></article-categories><title-group><article-title>Redrawing the Map of Great Britain from a Network of Human Interactions</article-title><alt-title alt-title-type="running-head">Borderline</alt-title></title-group><contrib-group>
<contrib contrib-type="author" xlink:type="simple"><name name-style="western"><surname>Ratti</surname><given-names>Carlo</given-names></name><xref ref-type="aff" rid="aff1"><sup>1</sup></xref></contrib>
<contrib contrib-type="author" xlink:type="simple"><name name-style="western"><surname>Sobolevsky</surname><given-names>Stanislav</given-names></name><xref ref-type="aff" rid="aff1"><sup>1</sup></xref></contrib>
<contrib contrib-type="author" xlink:type="simple"><name name-style="western"><surname>Calabrese</surname><given-names>Francesco</given-names></name><xref ref-type="aff" rid="aff1"><sup>1</sup></xref><xref ref-type="corresp" rid="cor1"><sup>*</sup></xref></contrib>
<contrib contrib-type="author" xlink:type="simple"><name name-style="western"><surname>Andris</surname><given-names>Clio</given-names></name><xref ref-type="aff" rid="aff1"><sup>1</sup></xref></contrib>
<contrib contrib-type="author" xlink:type="simple"><name name-style="western"><surname>Reades</surname><given-names>Jonathan</given-names></name><xref ref-type="aff" rid="aff1"><sup>1</sup></xref><xref ref-type="aff" rid="aff2"><sup>2</sup></xref></contrib>
<contrib contrib-type="author" xlink:type="simple"><name name-style="western"><surname>Martino</surname><given-names>Mauro</given-names></name><xref ref-type="aff" rid="aff1"><sup>1</sup></xref></contrib>
<contrib contrib-type="author" xlink:type="simple"><name name-style="western"><surname>Claxton</surname><given-names>Rob</given-names></name><xref ref-type="aff" rid="aff3"><sup>3</sup></xref></contrib>
<contrib contrib-type="author" xlink:type="simple"><name name-style="western"><surname>Strogatz</surname><given-names>Steven H.</given-names></name><xref ref-type="aff" rid="aff4"><sup>4</sup></xref></contrib>
</contrib-group><aff id="aff1"><label>1</label><addr-line>Senseable City Lab, Massachusetts Institute of Technology, Cambridge, Massachusetts, United States of America</addr-line>       </aff><aff id="aff2"><label>2</label><addr-line>Centre for Advanced Spatial Analysis, University College London, London, United Kingdom</addr-line>       </aff><aff id="aff3"><label>3</label><addr-line>BT Group, Ipswich, United Kingdom</addr-line>       </aff><aff id="aff4"><label>4</label><addr-line>Department of Mathematics, Cornell University, Ithaca, New York, United States of America</addr-line>       </aff><contrib-group>
<contrib contrib-type="editor" xlink:type="simple"><name name-style="western"><surname>Sporns</surname><given-names>Olaf</given-names></name>
<role>Editor</role>
<xref ref-type="aff" rid="edit1"/></contrib>
</contrib-group><aff id="edit1">Indiana University, United States of America</aff><author-notes>
<corresp id="cor1">* E-mail: <email xlink:type="simple">fcalabre@mit.edu</email></corresp>
<fn fn-type="con"><p>Conceived and designed the experiments: CR S. Sobolevsky FC CA JR MM RC S. Strogatz. Performed the experiments: CR S. Sobolevsky FC. Analyzed the data: CR S. Sobolevsky FC CA JR RC S. Strogatz. Contributed reagents/materials/analysis tools: CR S. Sobolevsky FC MM S. Strogatz. Wrote the paper: CR S. Sobolevsky FC JR S. Strogatz.</p></fn>
<fn fn-type="conflict"><p>Rob Claxton is employed by BT Group plc. This affiliation does not alter the authors' adherence to all PLoS ONE policies on the sharing of data and materials.</p></fn></author-notes><pub-date pub-type="collection"><year>2010</year></pub-date><pub-date pub-type="epub"><day>8</day><month>12</month><year>2010</year></pub-date><volume>5</volume><issue>12</issue><elocation-id>e14248</elocation-id><history>
<date date-type="received"><day>23</day><month>7</month><year>2010</year></date>
<date date-type="accepted"><day>4</day><month>11</month><year>2010</year></date>
</history><!--===== Grouping copyright info into permissions =====--><permissions><copyright-year>2010</copyright-year><copyright-holder>Ratti et al</copyright-holder><license><license-p>This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original and source are credited.</license-p></license></permissions><abstract>
<p>Do regional boundaries defined by governments respect the more natural ways that people interact across space? This paper proposes a novel, fine-grained approach to regional delineation, based on analyzing networks of billions of individual human transactions. Given a geographical area and some measure of the strength of links between its inhabitants, we show how to partition the area into smaller, non-overlapping regions while minimizing the disruption to each person's links. We tested our method on the largest non-Internet human network, inferred from a large telecommunications database in Great Britain. Our partitioning algorithm yields geographically cohesive regions that correspond remarkably well with administrative regions, while unveiling unexpected spatial structures that had previously only been hypothesized in the literature. We also quantify the effects of partitioning, showing for instance that the effects of a possible secession of Wales from Great Britain would be twice as disruptive for the human network than that of Scotland.</p>
</abstract><funding-group><funding-statement>The authors were partially funded by the AT&amp;T Foundation, the National Science Foundation, the National Defense Science and Engineering Fellowship Program, and Audi Volkswagen. Rob Claxton was funded by BT Group plc, which contributed to data collection and had no role in study design, data analysis, decision to publish, or preparation of the manuscript. The other funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.</funding-statement></funding-group><counts><page-count count="6"/></counts></article-meta>
</front>
<body><sec id="s1">
<title>Introduction</title>
<p>Do regional boundaries defined by governments respect the more natural ways that people interact across space? Beyond its fundamental importance in economic geography <xref ref-type="bibr" rid="pone.0014248-Lsch1">[1]</xref>–<xref ref-type="bibr" rid="pone.0014248-Coombes1">[4]</xref>, this question underlies many conflicts and struggles for regional independence across the world, such as those that have been recorded across parts of Great Britain over the past decades. To estimate the strength of inter- and intra-regional transactions, traditional analyses have relied on aggregate parameters such as local labour market data, commuter or travel flows and other indexes of accessibility and socioeconomic status <xref ref-type="bibr" rid="pone.0014248-Barkley1">[5]</xref>–<xref ref-type="bibr" rid="pone.0014248-Killian1">[9]</xref>. Here we propose a new, more fine-grained approach to regional delineation, based on analyzing networks of billions of individual human transactions that have recently become available <xref ref-type="bibr" rid="pone.0014248-Lazer1">[10]</xref>. Given a geographical area and some measure of the strength of links between its inhabitants, we show how to partition the area into smaller, non-overlapping regions while minimizing the disruption to each person's links. We tested our method on the largest non-Internet human network, composed of 20.8 million nodes inferred from a large telecommunications database in Great Britain <xref ref-type="bibr" rid="pone.0014248-Johnston1">[11]</xref>, <xref ref-type="bibr" rid="pone.0014248-Karlsson1">[12]</xref>. Our partitioning algorithm yields geographically cohesive regions that correspond remarkably well with traditional maps and with existing commuting and administrative data. The most striking differences are that Wales and parts of Yorkshire become merged into regions dominated by the major cities of the West and East Midlands, respectively. Our approach could be extended to other large-scale data sets arising in economic geography, urban planning and transportation studies, potentially creating a new type of regional analysis that more closely reflects patterns of human interaction.</p>
<p>We started with a telephone data set containing 12 billion calls over a one-month period, estimating more than 95% coverage of the Great Britain's residential and business landlines in that quarter. Using these data and the methodology explained in <xref ref-type="supplementary-material" rid="pone.0014248.s002">Text S1</xref>, we inferred a network of roughly 20.8×10<sup>6</sup> nodes and 85.8×10<sup>6</sup> undirected links. To safeguard personal privacy, individual phone numbers were anonymized by the operator before leaving storage facilities. Also, each caller's geographic location was specified at the level of spatial units based on a geographic agglomeration of sub-regional switching facility groups (covering 49 km<sup>2</sup> on average). Thus the geographic agglomeration acts as a kind of mask, preventing us from being able to pinpoint a customer's address, neighbourhood or village.</p>
<p>We assumed that the above network is a measure of human interactions at an individual level over all of Great Britain (see discussion in <xref ref-type="supplementary-material" rid="pone.0014248.s002">Text S1</xref> and below) and aggregated it into a grid of 3,042 square pixels, each with dimensions 9.5 km by 9.5 km. We treated each pixel as a spatial node and measured its connection strength to every other pixel, thereby deriving a matrix of the total bidirectional traffic between each pair of spatial nodes in the geographic network (<xref ref-type="fig" rid="pone-0014248-g001">Fig. 1</xref>). The resulting network of telephone traffic gives an indication of how tightly the thousands of different parts of Great Britain are connected, pixel by pixel. Please note that connection strength was calculated using total call time, hence taking into account the local population density.</p>
<fig id="pone-0014248-g001" position="float"><object-id pub-id-type="doi">10.1371/journal.pone.0014248.g001</object-id><label>Figure 1</label><caption>
<title>The geography of talk in Great Britain.</title>
<p>This figure shows the strongest 80% of links, as measured by total talk time, between areas within Britain. The opacity of each link is proportional to the total call time between two areas and the different colours represent regions identified using network modularity optimisation analysis.</p>
</caption><graphic mimetype="image" position="float" xlink:href="info:doi/10.1371/journal.pone.0014248.g001" xlink:type="simple"/></fig></sec><sec id="s2">
<title>Results and Discussion</title>
<p>The question naturally arises: What is the best way to group these pixels into larger regions? A similar question has been a focus of network research over the past decade; there one seeks the best way to partition a network into separate, non-overlapping communities <xref ref-type="bibr" rid="pone.0014248-SalesPardo1">[13]</xref>–<xref ref-type="bibr" rid="pone.0014248-Porter1">[18]</xref>. The leading approach is based on optimizing the network's “modularity” <xref ref-type="bibr" rid="pone.0014248-Newman1">[15]</xref>. High modularity values occur when the network is subdivided such that there are many links within communities and few between them, as compared to a randomly generated network with otherwise similar characteristics.</p>
<p>However, we are not trying to partition the network itself, but rather to use the network's characteristics to partition the geographic space underneath the network's topology while guaranteeing spatial adjacency, one of the essential features of a geographic region.</p>
<p>Nonetheless, we felt it might be instructive to ignore the adjacency constraint initially, to see what sorts of regions would be obtained. Following Newman's approach as a baseline, we applied his spectral optimization algorithm <xref ref-type="bibr" rid="pone.0014248-Newman2">[16]</xref>. Note that it was important to include loop edges (as proposed in <xref ref-type="bibr" rid="pone.0014248-Arenas1">[19]</xref>) in our analysis as it allowed us to correctly represent the human network from which we started (see <xref ref-type="supplementary-material" rid="pone.0014248.s003">Text S2</xref>).</p>
<p>After two iterations of the algorithm, a surprisingly accurate map of the Greater London region emerged, along with an area corresponding to Scotland, with just a few detached pixels scattered across the rest of Great Britain (<xref ref-type="fig" rid="pone-0014248-g002">Fig. 2 (a) and (b)</xref>).</p>
<fig id="pone-0014248-g002" position="float"><object-id pub-id-type="doi">10.1371/journal.pone.0014248.g002</object-id><label>Figure 2</label><caption>
<title>Defining regions through the spectral modularity optimization of telecommunications networks.</title>
<p>a - even with just three regions we obtain a total modularity of 0.31, indicating a fairly good network partitioning. b - the final partitioning of Great Britain yields a modularity of 0.58. c - further fine tuning according to the process suggested by Newman <xref ref-type="bibr" rid="pone.0014248-Newman2">[16]</xref> increases the modularity to 0.60.</p>
</caption><graphic mimetype="image" position="float" xlink:href="info:doi/10.1371/journal.pone.0014248.g002" xlink:type="simple"/></fig>
<p>With subsequent iterations the modularity increased, ultimately converging to a maximum of 0.58, indicative of a good partitioning compared to the randomized network, as mentioned in <xref ref-type="bibr" rid="pone.0014248-Newman1">[15]</xref>, <xref ref-type="bibr" rid="pone.0014248-Guimera1">[20]</xref>. The resulting subdivision had 23 communities, 13 of which were clearly delineated geographically, although some scattered pixels and fuzzy boundaries remained. To determine if these artefacts were due to noise produced by the heuristics of spectral partitioning, we next fine-tuned the spectral partitioning algorithm in a manner suggested by Newman <xref ref-type="bibr" rid="pone.0014248-Newman2">[16]</xref>, iteratively moving pixels from one region to another to maximize overall modularity (see <xref ref-type="supplementary-material" rid="pone.0014248.s004">Text S3</xref>). When applied to our data, this process removed the fuzzy boundaries, attached the scattered pixels to their nearest neighbours, and increased the modularity to 0.60.</p>
<p><xref ref-type="fig" rid="pone-0014248-g002">Figure 2(c)</xref> shows the resulting map. Its regional cohesiveness is unexpected: we began by looking at the human network as a topological entity with no geographical constraints, but uncovered clear regions in space that respect spatial adjacency. Apparently the telecommunication links between individuals—and the interpersonal transactions that they capture—are so intertwined with geographical space that partitioning at a network-topological level produces a very accurate partitioning of geographic space. Compared to previously suggested distance-decay models of telecommunication in space <xref ref-type="bibr" rid="pone.0014248-Zipf1">[21]</xref>–<xref ref-type="bibr" rid="pone.0014248-LibenNowell1">[24]</xref>, our technique for partitioning shows that not only population distribution in space but also regional boundaries affect the patterns of communication. They also seem to confirm the spatial cohesiveness of partitions defined on mobility networks at an aggregate level, such as airplane connections and banknote movement <xref ref-type="bibr" rid="pone.0014248-Guimer1">[25]</xref>, <xref ref-type="bibr" rid="pone.0014248-Thiemann1">[26]</xref>.</p>
<p>Before embarking on the detailed examination of our regions, however, we should check how stable our boundaries are. As it has been shown <xref ref-type="bibr" rid="pone.0014248-SalesPardo1">[13]</xref>, <xref ref-type="bibr" rid="pone.0014248-Good1">[27]</xref>, a modularity function such as ours is likely to have exponentially many local maxima, and these maxima typically have different clustered structures. Our partition is likely not to be the global maximum and there are probably alternative local maxima with a high modularity score. What would the corresponding boundaries be? To find out we implemented several modularity partitioning methods (see especially <xref ref-type="supplementary-material" rid="pone.0014248.s001">Figure S1</xref> and <xref ref-type="supplementary-material" rid="pone.0014248.s004">Text S3</xref>). The results are reassuring: there is indeed some variation along the boundaries, but we always find cohesive regions centred approximately in the same place. Also, if we intersect all regions obtained with the different methods, we find 11 stable “cores” that are always separated from each other by “peripheral” regions that lie at the boundaries and have somewhat ambiguous associations (<xref ref-type="fig" rid="pone-0014248-g003">Fig. 3</xref>). It should be noted that these “cores” highlight very densely populated areas and contain the great majority of Great Britain's population (85%). Conversely the peripheral regions are very sparsely inhabited. The regional partitioning is also robust with respect to uncertainty in the data, as proven by subsampling (see <xref ref-type="supplementary-material" rid="pone.0014248.s005">Text S4</xref>), and seems indicative of a highly modular network <xref ref-type="bibr" rid="pone.0014248-Guimera1">[20]</xref>, <xref ref-type="bibr" rid="pone.0014248-Good1">[27]</xref>, as seen by comparison with many null models that have an average modularity score of less than 0.02 (see <xref ref-type="supplementary-material" rid="pone.0014248.s006">Text S5</xref>). We recognize the limits of resolution due to the modularity definition <xref ref-type="bibr" rid="pone.0014248-Fortunato2">[28]</xref>. As we are interested in detecting large regions comparable to the official administrative ones, our analysis did not suffer of this issue. However, multi-resolutions methods could be used to detect smaller robust communities (see <xref ref-type="bibr" rid="pone.0014248-Fortunato1">[17]</xref>).</p>
<fig id="pone-0014248-g003" position="float"><object-id pub-id-type="doi">10.1371/journal.pone.0014248.g003</object-id><label>Figure 3</label><caption>
<title>The core regions of Britain.</title>
<p>By combining the output from several modularity optimization methods we obtain the results shown in this figure. The thick black boundary lines show the official Government Office Regions partitioning together with Scotland and Wales. The black background spots show Britain's towns and cities, some of which are highlighted with a label.</p>
</caption><graphic mimetype="image" position="float" xlink:href="info:doi/10.1371/journal.pone.0014248.g003" xlink:type="simple"/></fig>
<p>Another interesting point is that the core map based on human interactions divides Great Britain into approximately the number of “official” Nomenclature of Territorial Units for Statistics 1 (NUTS) British regions (11) —with boundaries that approximately coincide with the traditional ones (<xref ref-type="fig" rid="pone-0014248-g003">Fig. 3</xref>). Many of the telecom regions—those corresponding to Scotland, South West, London and the East of England—closely match the forms of historically and administratively important regions. In fact, on average about 80% of pixels fall within a corresponding (by largest overlap) telecom region. While not surprising, this finding seems to corroborate our method: we would indeed expect an agreement between the administrative boundaries and those found from human interaction, as they probably evolved together, over many centuries of mutual interplay—cohesive patterns within society promoting change in administrative boundaries and the latter, in turn, affecting human interaction.</p>
<p>The most obvious difference between the two maps is that Wales, and to a lesser extent Yorkshire, seem to have been incorporated into regions dominated by the major cities of the West and East Midlands regions, respectively. Moreover, we have also “found” a new region developing to the west of London. The first finding supports hypotheses that have long circulated in the transport and regional studies literature: detailed commuting data from the 2001 census was used to generate regions where 95% of trips are internal to that region, finding that Wales, in spite of its unique cultural and linguistic heritage, is well integrated with its English neighbours to the East <xref ref-type="bibr" rid="pone.0014248-Nielsen1">[29]</xref>. Also, the resulting northern and southern Welsh regions match extremely well with our maps. The second finding, of a new region just west of London, corroborates an earlier study of a ‘Western Crescent’ of high-tech activity <xref ref-type="bibr" rid="pone.0014248-Hall1">[30]</xref>: a cohesive area that generally scores extremely well in measures of economic activity and low levels of deprivation, as measured by Gross Value Added (GVA) and qualifications (NVQ) for Berkshire, Buckinghamshire, and Oxfordshire <xref ref-type="bibr" rid="pone.0014248-UK1">[31]</xref>. Our partitioning, in short, seems to capture human interaction more accurately than the official NUTS regions. We also overlaid a map of modern English-only dialects <xref ref-type="bibr" rid="pone.0014248-Trudgill1">[32]</xref>. Even if the boundaries of 16 dialects were not well defined, we could informally estimate that the East of England and, in particular, East Anglia matched up fairly well with our corresponding region (around 60% overlap between regions), although the overall overlap between telecom regions and corresponding - by largest overlap - dialects regions was only around 25%. This is what we would expect in country that has undergone centuries of linguistic integration.</p>
<p>There are other metrics for which the partitioning scores better than NUTS. Per our initial hypothesis our regions would produce fewer disturbances to the network of human interaction. This can be seen in <xref ref-type="supplementary-material" rid="pone.0014248.s004">Text S3</xref>, where we show that boundaries obtained with all modularity partitioning methods always cut fewer ties across the network. Another measure by which our partitioning scores better is that our predicted boundaries cross areas with very low population density (50% that of the official boundaries).</p>
<p>The above partitioning of Great Britain using telecommunication data also suggests the extent to which each region is integrated into the country as a whole. To measure this, we calculate the call time ratio, defined as the percentage of time a region talks to itself. By this measure, Scotland is the region least connected to the rest of Great Britain, followed by North Wales, South Wales and Greater London. What is particularly striking about Scotland is that the call time ratio is 76.7%, meaning that just 23.3% of all call time placed or received in Scotland goes to or comes from another part of the country (as a comparison in a random network we would have only 37% call time ratio). Scotland appears to be loosely coupled with the rest of Great Britain in a way that Wales emphatically is not. In other terms, if Scotland and Wales were to become independent from the UK, and if the detrimental effect of the secession were considered proportional to the number of external connections, the effect on people would be approximately twice more disruptive on Wales than Scotland.</p>
<p>All of the above analysis is based on the pattern of landline calls, but our method could easily be used on other networks in the future: data from mobile phones could be an indicator of more personal (as opposed to household and business-oriented) human interaction <xref ref-type="bibr" rid="pone.0014248-Onnela1">[33]</xref>, while databases from credit card companies could highlight commercial links between individuals. One could even imagine applying a similar analysis to the movement patterns of each individual, and determine boundaries that would minimize their disturbance <xref ref-type="bibr" rid="pone.0014248-Brockmann1">[34]</xref>–<xref ref-type="bibr" rid="pone.0014248-Wang1">[36]</xref>. All together, these approaches could lead to a new perspective in regional studies, transportation planning and economic geography.</p>
</sec><sec id="s3">
<title>Supporting Information</title>
<supplementary-material id="pone.0014248.s001" mimetype="image/tiff" position="float" xlink:href="info:doi/10.1371/journal.pone.0014248.s001" xlink:type="simple"><label>Figure S1</label><caption>
<p>Defining regions through the spectral modularity optimization. Results of five different modularity optimization algorithms.</p>
<p>(5.66 MB TIF)</p>
</caption></supplementary-material><supplementary-material id="pone.0014248.s002" mimetype="application/msword" position="float" xlink:href="info:doi/10.1371/journal.pone.0014248.s002" xlink:type="simple"><label>Text S1</label><caption>
<p>Inferring the network of human interactions from calling data.</p>
<p>(0.05 MB DOC)</p>
</caption></supplementary-material><supplementary-material id="pone.0014248.s003" mimetype="application/msword" position="float" xlink:href="info:doi/10.1371/journal.pone.0014248.s003" xlink:type="simple"><label>Text S2</label><caption>
<p>Definition of modularity.</p>
<p>(0.19 MB DOC)</p>
</caption></supplementary-material><supplementary-material id="pone.0014248.s004" mimetype="application/msword" position="float" xlink:href="info:doi/10.1371/journal.pone.0014248.s004" xlink:type="simple"><label>Text S3</label><caption>
<p>Comparing different modularity optimization methods.</p>
<p>(0.06 MB DOC)</p>
</caption></supplementary-material><supplementary-material id="pone.0014248.s005" mimetype="application/msword" position="float" xlink:href="info:doi/10.1371/journal.pone.0014248.s005" xlink:type="simple"><label>Text S4</label><caption>
<p>Subsampling the network data.</p>
<p>(0.04 MB DOC)</p>
</caption></supplementary-material><supplementary-material id="pone.0014248.s006" mimetype="application/msword" position="float" xlink:href="info:doi/10.1371/journal.pone.0014248.s006" xlink:type="simple"><label>Text S5</label><caption>
<p>Comparison with null model.</p>
<p>(0.04 MB DOC)</p>
</caption></supplementary-material></sec></body>
<back>
<ack>
<p>The authors thank the BT Group, the National Science Foundation, the AT&amp;T Foundation, the National Defense Science and Engineering Fellowship Program, the MIT SMART program, GE, Audi Volkswagen, SNCF, ENEL and the members of the MIT Senseable City Lab Consortium for supporting the research. Janet Owers provided expert editorial guidance.</p>
</ack>
<ref-list>
<title>References</title>
<ref id="pone.0014248-Lsch1"><label>1</label><element-citation publication-type="journal" xlink:type="simple">             <person-group person-group-type="author">
<name name-style="western"><surname>Lösch</surname><given-names>A</given-names></name>
</person-group>             <year>1938</year>             <article-title>The nature of economic regions.</article-title>             <source>South Econ J</source>             <volume>5</volume>             <fpage>71</fpage>             <lpage>78</lpage>          </element-citation></ref>
<ref id="pone.0014248-Christaller1"><label>2</label><element-citation publication-type="other" xlink:type="simple">             <person-group person-group-type="author">
<name name-style="western"><surname>Christaller</surname><given-names>W</given-names></name>
</person-group>             <year>1933</year>             <article-title>Central Places in Southern Germany.</article-title>             <source>Prentice-Hall, Englewood Cliffs</source>          </element-citation></ref>
<ref id="pone.0014248-Fujita1"><label>3</label><element-citation publication-type="other" xlink:type="simple">             <person-group person-group-type="author">
<name name-style="western"><surname>Fujita</surname><given-names>M</given-names></name>
<name name-style="western"><surname>Krugman</surname><given-names>PR</given-names></name>
<name name-style="western"><surname>Venables</surname><given-names>AJ</given-names></name>
</person-group>             <year>2001</year>             <article-title>The Spatial Economy: Cities, Regions and International Trade.</article-title>             <publisher-loc>Cambridge</publisher-loc>             <publisher-name>MIT Press</publisher-name>          </element-citation></ref>
<ref id="pone.0014248-Coombes1"><label>4</label><element-citation publication-type="journal" xlink:type="simple">             <person-group person-group-type="author">
<name name-style="western"><surname>Coombes</surname><given-names>M</given-names></name>
</person-group>             <year>2000</year>             <article-title>Defining locality boundaries with synthetic data.</article-title>             <source>Environ Plann A</source>             <volume>32</volume>             <fpage>1499</fpage>             <lpage>1518</lpage>          </element-citation></ref>
<ref id="pone.0014248-Barkley1"><label>5</label><element-citation publication-type="journal" xlink:type="simple">             <person-group person-group-type="author">
<name name-style="western"><surname>Barkley</surname><given-names>DL</given-names></name>
<name name-style="western"><surname>Henry</surname><given-names>MS</given-names></name>
<name name-style="western"><surname>Bao</surname><given-names>S</given-names></name>
</person-group>             <year>1996</year>             <article-title>Identifying “spread” versus “backwash” effects in regional economic areas: A density functions approach.</article-title>             <source>Land Econ</source>             <volume>72</volume>             <fpage>336</fpage>             <lpage>357</lpage>          </element-citation></ref>
<ref id="pone.0014248-Micklander1"><label>6</label><element-citation publication-type="other" xlink:type="simple">             <person-group person-group-type="author">
<name name-style="western"><surname>Micklander</surname><given-names>Å</given-names></name>
</person-group>             <year>1971</year>             <article-title>Commuting and Commuting Areas.</article-title>             <publisher-loc>Stockholm</publisher-loc>             <publisher-name>Allmänna Förlaget</publisher-name>          </element-citation></ref>
<ref id="pone.0014248-Brown1"><label>7</label><element-citation publication-type="journal" xlink:type="simple">             <person-group person-group-type="author">
<name name-style="western"><surname>Brown</surname><given-names>LA</given-names></name>
<name name-style="western"><surname>Holmes</surname><given-names>J</given-names></name>
</person-group>             <year>1971</year>             <article-title>The delimitation of functional regions, nodal regions, and hierarchies by functional distance approaches.</article-title>             <source>Jof Reg Sci</source>             <volume>11</volume>             <fpage>57</fpage>             <lpage>72</lpage>          </element-citation></ref>
<ref id="pone.0014248-Hemmasi1"><label>8</label><element-citation publication-type="journal" xlink:type="simple">             <person-group person-group-type="author">
<name name-style="western"><surname>Hemmasi</surname><given-names>M</given-names></name>
</person-group>             <year>1980</year>             <article-title>The identification of functional regions based on lifetime migration data: a case study of Iran.</article-title>             <source>Econ Geogr</source>             <volume>56</volume>             <fpage>223</fpage>             <lpage>233</lpage>          </element-citation></ref>
<ref id="pone.0014248-Killian1"><label>9</label><element-citation publication-type="other" xlink:type="simple">             <person-group person-group-type="author">
<name name-style="western"><surname>Killian</surname><given-names>MS</given-names></name>
<name name-style="western"><surname>Tolbert</surname><given-names>CM</given-names></name>
<name name-style="western"><surname>Singelmann</surname><given-names>J</given-names></name>
<name name-style="western"><surname>Desaran</surname><given-names>FA</given-names></name>
</person-group>             <year>1993</year>             <article-title>Mapping social and economic space: the delineation of local labour markets in the United States.</article-title>             <source>Inequalities in Labour Market Areas</source>             <publisher-name>Westview, Boulder</publisher-name>          </element-citation></ref>
<ref id="pone.0014248-Lazer1"><label>10</label><element-citation publication-type="journal" xlink:type="simple">             <person-group person-group-type="author">
<name name-style="western"><surname>Lazer</surname><given-names>D</given-names></name>
<name name-style="western"><surname>Pentland</surname><given-names>A</given-names></name>
<name name-style="western"><surname>Adamic</surname><given-names>L</given-names></name>
<name name-style="western"><surname>Aral</surname><given-names>S</given-names></name>
<name name-style="western"><surname>Barabási</surname><given-names>A-L</given-names></name>
<etal/></person-group>             <year>2009</year>             <article-title>Computational social science.</article-title>             <source>Science</source>             <volume>323</volume>             <fpage>721</fpage>             <lpage>723</lpage>          </element-citation></ref>
<ref id="pone.0014248-Johnston1"><label>11</label><element-citation publication-type="journal" xlink:type="simple">             <person-group person-group-type="author">
<name name-style="western"><surname>Johnston</surname><given-names>KP</given-names></name>
</person-group>             <year>1995</year>             <article-title>Redefinition of the BEA Economic Areas.</article-title>             <source>Surv Curr Bus</source>             <volume>84</volume>             <fpage>75</fpage>             <lpage>81</lpage>          </element-citation></ref>
<ref id="pone.0014248-Karlsson1"><label>12</label><element-citation publication-type="journal" xlink:type="simple">             <person-group person-group-type="author">
<name name-style="western"><surname>Karlsson</surname><given-names>C</given-names></name>
<name name-style="western"><surname>Olsson</surname><given-names>M</given-names></name>
</person-group>             <year>2006</year>             <article-title>The identification of functional regions: theory, methods, and applications.</article-title>             <source>Ann Reg Sci</source>             <volume>40</volume>             <fpage>1</fpage>             <lpage>18</lpage>          </element-citation></ref>
<ref id="pone.0014248-SalesPardo1"><label>13</label><element-citation publication-type="journal" xlink:type="simple">             <person-group person-group-type="author">
<name name-style="western"><surname>Sales-Pardo</surname><given-names>M</given-names></name>
<name name-style="western"><surname>Guimerà</surname><given-names>R</given-names></name>
<name name-style="western"><surname>Moreira</surname><given-names>AA</given-names></name>
<name name-style="western"><surname>Amaral</surname><given-names>LA</given-names></name>
</person-group>             <year>2007</year>             <article-title>Extracting the hierarchical organization of complex systems.</article-title>             <source>Proceedings of the National Academy of Sciences</source>             <volume>104</volume>             <fpage>15224</fpage>             <lpage>15229</lpage>          </element-citation></ref>
<ref id="pone.0014248-Ziv1"><label>14</label><element-citation publication-type="journal" xlink:type="simple">             <person-group person-group-type="author">
<name name-style="western"><surname>Ziv</surname><given-names>E</given-names></name>
<name name-style="western"><surname>Middendorf</surname><given-names>M</given-names></name>
<name name-style="western"><surname>Wiggins</surname><given-names>CH</given-names></name>
</person-group>             <year>2005</year>             <article-title>Information-theoretic approach to network modularity.</article-title>             <source>Phys Rev E</source>             <volume>71</volume>             <fpage>046117</fpage>          </element-citation></ref>
<ref id="pone.0014248-Newman1"><label>15</label><element-citation publication-type="journal" xlink:type="simple">             <person-group person-group-type="author">
<name name-style="western"><surname>Newman</surname><given-names>MEJ</given-names></name>
</person-group>             <year>2004</year>             <article-title>Detecting community structure in networks.</article-title>             <source>Eur Phys J B</source>             <volume>38</volume>             <fpage>321</fpage>             <lpage>330</lpage>          </element-citation></ref>
<ref id="pone.0014248-Newman2"><label>16</label><element-citation publication-type="journal" xlink:type="simple">             <person-group person-group-type="author">
<name name-style="western"><surname>Newman</surname><given-names>MEJ</given-names></name>
</person-group>             <year>2006</year>             <article-title>Modularity and community structure in networks.</article-title>             <source>Proceedings of the National Academy of Sciences</source>             <volume>103</volume>             <fpage>8577</fpage>             <lpage>8582</lpage>          </element-citation></ref>
<ref id="pone.0014248-Fortunato1"><label>17</label><element-citation publication-type="journal" xlink:type="simple">             <person-group person-group-type="author">
<name name-style="western"><surname>Fortunato</surname><given-names>S</given-names></name>
</person-group>             <year>2010</year>             <article-title>Community detection in graphs.</article-title>             <source>Physics Reports</source>             <volume>486</volume>             <issue>3-5</issue>             <fpage>75</fpage>             <lpage>174</lpage>          </element-citation></ref>
<ref id="pone.0014248-Porter1"><label>18</label><element-citation publication-type="journal" xlink:type="simple">             <person-group person-group-type="author">
<name name-style="western"><surname>Porter</surname><given-names>AP</given-names></name>
<name name-style="western"><surname>Onnela</surname><given-names>LP</given-names></name>
<name name-style="western"><surname>Mucha</surname><given-names>PJ</given-names></name>
</person-group>             <year>2009</year>             <article-title>Communities in networks.</article-title>             <source>Notices of the American Mathematical Society</source>             <volume>56</volume>             <issue>9</issue>             <fpage>1082</fpage>             <lpage>1097</lpage>          </element-citation></ref>
<ref id="pone.0014248-Arenas1"><label>19</label><element-citation publication-type="journal" xlink:type="simple">             <person-group person-group-type="author">
<name name-style="western"><surname>Arenas</surname><given-names>A</given-names></name>
<name name-style="western"><surname>Dutch</surname><given-names>J</given-names></name>
<name name-style="western"><surname>Fernandez</surname><given-names>A</given-names></name>
<name name-style="western"><surname>Gomez</surname><given-names>S</given-names></name>
</person-group>             <year>2007</year>             <article-title>Size reduction of complex networks preserving modularity.</article-title>             <source>New Journal of Physics</source>             <volume>9</volume>             <issue>176</issue>          </element-citation></ref>
<ref id="pone.0014248-Guimera1"><label>20</label><element-citation publication-type="journal" xlink:type="simple">             <person-group person-group-type="author">
<name name-style="western"><surname>Guimera</surname><given-names>R</given-names></name>
<name name-style="western"><surname>Sales-Pardo</surname><given-names>M</given-names></name>
<name name-style="western"><surname>Amaral</surname><given-names>LAN</given-names></name>
</person-group>             <year>2004</year>             <article-title>Modularity from fluctuations in random graphs and complex networks.</article-title>             <source>Phys Rev E</source>             <volume>70</volume>             <fpage>025101(R)</fpage>          </element-citation></ref>
<ref id="pone.0014248-Zipf1"><label>21</label><element-citation publication-type="other" xlink:type="simple">             <person-group person-group-type="author">
<name name-style="western"><surname>Zipf</surname><given-names>G</given-names></name>
</person-group>             <year>1949</year>             <article-title>The Economy of Geography.</article-title>             <publisher-loc>Germany</publisher-loc>             <publisher-name>Addison-Wesley</publisher-name>          </element-citation></ref>
<ref id="pone.0014248-Lambiotte1"><label>22</label><element-citation publication-type="journal" xlink:type="simple">             <person-group person-group-type="author">
<name name-style="western"><surname>Lambiotte</surname><given-names>R</given-names></name>
<etal/></person-group>             <year>2008</year>             <article-title>Geographical dispersal of mobile communication networks.</article-title>             <source>Physica A</source>             <volume>387</volume>             <fpage>5317</fpage>             <lpage>5325</lpage>          </element-citation></ref>
<ref id="pone.0014248-Krings1"><label>23</label><element-citation publication-type="journal" xlink:type="simple">             <person-group person-group-type="author">
<name name-style="western"><surname>Krings</surname><given-names>G</given-names></name>
<name name-style="western"><surname>Calabrese</surname><given-names>F</given-names></name>
<name name-style="western"><surname>Ratti</surname><given-names>C</given-names></name>
<name name-style="western"><surname>Blondel</surname><given-names>VD</given-names></name>
</person-group>             <year>2009</year>             <article-title>Urban gravity: a model for inter-city telecommunication flows.</article-title>             <source>J Stat Mech Theory and Experiment</source>             <volume>07</volume>             <fpage>1</fpage>             <lpage>8</lpage>          </element-citation></ref>
<ref id="pone.0014248-LibenNowell1"><label>24</label><element-citation publication-type="journal" xlink:type="simple">             <person-group person-group-type="author">
<name name-style="western"><surname>Liben-Nowell</surname><given-names>D</given-names></name>
<name name-style="western"><surname>Novak</surname><given-names>J</given-names></name>
<name name-style="western"><surname>Kumar</surname><given-names>R</given-names></name>
<name name-style="western"><surname>Raghavan</surname><given-names>P</given-names></name>
<name name-style="western"><surname>Tomkins</surname><given-names>A</given-names></name>
</person-group>             <year>2005</year>             <article-title>Geographic routing in social networks.</article-title>             <source>Proceedings of the National Academy of Sciences</source>             <volume>102</volume>             <fpage>11623</fpage>             <lpage>11628</lpage>          </element-citation></ref>
<ref id="pone.0014248-Guimer1"><label>25</label><element-citation publication-type="journal" xlink:type="simple">             <person-group person-group-type="author">
<name name-style="western"><surname>Guimerà</surname><given-names>R</given-names></name>
<name name-style="western"><surname>Mossa</surname><given-names>S</given-names></name>
<name name-style="western"><surname>Turtschi</surname><given-names>A</given-names></name>
<name name-style="western"><surname>Amaral</surname><given-names>LAN</given-names></name>
</person-group>             <year>2005</year>             <article-title>The worldwide air transportation network: Anomalous centrality, community structure, and cities' global roles.</article-title>             <source>Proceedings of the National Academy of Sciences of the United States of America</source>             <volume>102</volume>             <fpage>7794</fpage>             <lpage>7799</lpage>          </element-citation></ref>
<ref id="pone.0014248-Thiemann1"><label>26</label><element-citation publication-type="journal" xlink:type="simple">             <person-group person-group-type="author">
<name name-style="western"><surname>Thiemann</surname><given-names>C</given-names></name>
<name name-style="western"><surname>Theis</surname><given-names>F</given-names></name>
<name name-style="western"><surname>Grady</surname><given-names>D</given-names></name>
<name name-style="western"><surname>Brune</surname><given-names>R</given-names></name>
<name name-style="western"><surname>Brockmann</surname><given-names>D</given-names></name>
</person-group>             <year>2010</year>             <article-title>The structure of borders in a small world.</article-title>             <comment>Available: <ext-link ext-link-type="uri" xlink:href="http://arxiv.org/abs/1001.0943/" xlink:type="simple">http://arxiv.org/abs/1001.0943/</ext-link>. Accessed 2010 Nov 20</comment>          </element-citation></ref>
<ref id="pone.0014248-Good1"><label>27</label><element-citation publication-type="journal" xlink:type="simple">             <person-group person-group-type="author">
<name name-style="western"><surname>Good</surname><given-names>BH</given-names></name>
<name name-style="western"><surname>de Montjoye</surname><given-names>Y-A</given-names></name>
<name name-style="western"><surname>Clauset</surname><given-names>A</given-names></name>
</person-group>             <year>2010</year>             <article-title>The performance of modularity maximization in practical contexts.</article-title>             <source>Phys Rev E</source>             <volume>81 046106</volume>          </element-citation></ref>
<ref id="pone.0014248-Fortunato2"><label>28</label><element-citation publication-type="journal" xlink:type="simple">             <person-group person-group-type="author">
<name name-style="western"><surname>Fortunato</surname><given-names>S</given-names></name>
<name name-style="western"><surname>Barthelemy</surname><given-names>M</given-names></name>
</person-group>             <year>2006</year>             <article-title>Resolution limit in community detection,</article-title>             <source>Proceedings of the National Academy of Sciences</source>             <volume>104</volume>             <fpage>36</fpage>             <lpage>41</lpage>          </element-citation></ref>
<ref id="pone.0014248-Nielsen1"><label>29</label><element-citation publication-type="journal" xlink:type="simple">             <person-group person-group-type="author">
<name name-style="western"><surname>Nielsen</surname><given-names>T</given-names></name>
<name name-style="western"><surname>Hovgesen</surname><given-names>H</given-names></name>
</person-group>             <year>2008</year>             <article-title>Exploratory mapping of commuter flows in England and Wales,</article-title>             <source>Journal of Transport Geography</source>             <volume>16</volume>             <issue>2</issue>             <fpage>90</fpage>             <lpage>99</lpage>          </element-citation></ref>
<ref id="pone.0014248-Hall1"><label>30</label><element-citation publication-type="other" xlink:type="simple">             <person-group person-group-type="author">
<name name-style="western"><surname>Hall</surname><given-names>P</given-names></name>
<name name-style="western"><surname>Breheny</surname><given-names>M</given-names></name>
<name name-style="western"><surname>McQuaid</surname><given-names>R</given-names></name>
<name name-style="western"><surname>Hart</surname><given-names>D</given-names></name>
</person-group>             <year>1987</year>             <publisher-loc>Allen &amp; Unwin, London, Sydney and Wellington</publisher-loc>             <publisher-name>Western Sunrise: the genesis and growth of Britain's major high tech corridor</publisher-name>          </element-citation></ref>
<ref id="pone.0014248-UK1"><label>31</label><element-citation publication-type="journal" xlink:type="simple">             <article-title>UK statistics.</article-title>             <comment>Available: <ext-link ext-link-type="uri" xlink:href="http://www.neighbourhood.statistics.gov" xlink:type="simple">http://www.neighbourhood.statistics.gov</ext-link> and <ext-link ext-link-type="uri" xlink:href="http://www.statistics.gov.uk/" xlink:type="simple">http://www.statistics.gov.uk/</ext-link>. Accessed 2010 Nov 20</comment>          </element-citation></ref>
<ref id="pone.0014248-Trudgill1"><label>32</label><element-citation publication-type="other" xlink:type="simple">             <person-group person-group-type="author">
<name name-style="western"><surname>Trudgill</surname><given-names>P</given-names></name>
</person-group>             <year>1999</year>             <publisher-loc>Oxford</publisher-loc>             <publisher-name>The Dialects of England, Blackwell Publishers</publisher-name>          </element-citation></ref>
<ref id="pone.0014248-Onnela1"><label>33</label><element-citation publication-type="journal" xlink:type="simple">             <person-group person-group-type="author">
<name name-style="western"><surname>Onnela</surname><given-names>J-P</given-names></name>
<name name-style="western"><surname>Saramaki</surname><given-names>J</given-names></name>
<name name-style="western"><surname>Hyvonen</surname><given-names>J</given-names></name>
<name name-style="western"><surname>Szabo</surname><given-names>G</given-names></name>
<name name-style="western"><surname>Lazer</surname><given-names>D</given-names></name>
<etal/></person-group>             <year>2007</year>             <article-title>Structure and tie strengths in mobile communication networks.</article-title>             <source>Proceedings of the National Academy of Sciences</source>             <volume>104</volume>             <fpage>7332</fpage>             <lpage>7336</lpage>          </element-citation></ref>
<ref id="pone.0014248-Brockmann1"><label>34</label><element-citation publication-type="journal" xlink:type="simple">             <person-group person-group-type="author">
<name name-style="western"><surname>Brockmann</surname><given-names>D</given-names></name>
<name name-style="western"><surname>Hufnagel</surname><given-names>L</given-names></name>
<name name-style="western"><surname>Geisel</surname><given-names>T</given-names></name>
</person-group>             <year>2006</year>             <article-title>The scaling laws of human travel.</article-title>             <source>Nature</source>             <volume>439</volume>             <fpage>462</fpage>             <lpage>465</lpage>          </element-citation></ref>
<ref id="pone.0014248-Gonzalez1"><label>35</label><element-citation publication-type="journal" xlink:type="simple">             <person-group person-group-type="author">
<name name-style="western"><surname>Gonzalez</surname><given-names>MC</given-names></name>
<name name-style="western"><surname>Hidalgo</surname><given-names>CA</given-names></name>
<name name-style="western"><surname>Barabasi</surname><given-names>A-L</given-names></name>
</person-group>             <year>2008</year>             <article-title>Understanding individual human mobility patterns.</article-title>             <source>Nature</source>             <volume>453</volume>             <fpage>779</fpage>             <lpage>782</lpage>          </element-citation></ref>
<ref id="pone.0014248-Wang1"><label>36</label><element-citation publication-type="journal" xlink:type="simple">             <person-group person-group-type="author">
<name name-style="western"><surname>Wang</surname><given-names>P</given-names></name>
<name name-style="western"><surname>Gonzalez</surname><given-names>MC</given-names></name>
<name name-style="western"><surname>Hidalgo</surname><given-names>CA</given-names></name>
<name name-style="western"><surname>Barabasi</surname><given-names>A-L</given-names></name>
</person-group>             <year>2009</year>             <article-title>Understanding the spreading patterns of mobile phone viruses.</article-title>             <source>Science</source>             <volume>324</volume>             <fpage>1071</fpage>             <lpage>1076</lpage>          </element-citation></ref>
</ref-list>

</back>
</article>