{
    "version" : "https://jsonfeed.org/version/1",
    "content" : "news",
    "type" : "single",
    "title" : "Caution: Your Web Analytics Might Not Be Human |Digital.gov",
    "description": "Caution: Your Web Analytics Might Not Be Human",
    "home_page_url" : "/preview/gsa/digitalgov.gov/replace-hugo-links-4-migration-archive/","feed_url" : "/preview/gsa/digitalgov.gov/replace-hugo-links-4-migration-archive/2015/09/09/caution-your-web-analytics-might-not-be-human/index.json","item" : [
    {"title" :"Caution: Your Web Analytics Might Not Be Human","summary" : "A Digital Analytics Program (DAP) user recently contacted me with an observation/problem: The data he had from his website’s independent Web-analytics account was much, much higher than the data he was receiving in the DAP user interface. Theoretically, both tools (in this case, two separate Google Analytics accounts), were trying to measure the same thing,","date" : "2015-09-09T10:00:14-04:00","date_modified" : "2025-02-14T09:43:36-05:00","authors" : {"tlowden" : "Tim Lowden"},"topics" : {
        
            "analytics" : "Analytics",
            "open-data" : "Open data"
            },"branch" : "replace-hugo-links-4-migration-archive",
      "filename" :"2015-09-09-caution-your-web-analytics-might-not-be-human.md",
      
      "filepath" :"news/2015/09/2015-09-09-caution-your-web-analytics-might-not-be-human.md",
      "filepathURL" :"https://github.com/GSA/digitalgov.gov/blob/replace-hugo-links-4-migration-archive/content/news/2015/09/2015-09-09-caution-your-web-analytics-might-not-be-human.md",
      "editpathURL" :"https://github.com/GSA/digitalgov.gov/edit/replace-hugo-links-4-migration-archive/content/news/2015/09/2015-09-09-caution-your-web-analytics-might-not-be-human.md","slug" : "caution-your-web-analytics-might-not-be-human","url" : "/preview/gsa/digitalgov.gov/replace-hugo-links-4-migration-archive/2015/09/09/caution-your-web-analytics-might-not-be-human/","content" :"\u003cp\u003eA \u003ca href=\"/preview/gsa/digitalgov.gov/replace-hugo-links-4-migration-archive/guides/dap/\" target=\"_blank\"\u003eDigital Analytics Program (DAP)\u003c/a\u003e user recently contacted me with an observation/problem: The data he had from his website’s independent Web-analytics account was much, much higher than the data he was receiving in the DAP user interface. Theoretically, both tools (in this case, two separate Google Analytics accounts), were trying to measure the same thing, and he couldn’t figure out why the numbers would be so different.\u003c/p\u003e\n\u003cp\u003eWhen I say different, I mean substantially so. Looking at the pageviews metric, the agency implementation was reporting almost 33% MORE views than DAP. Naturally, he hoped that the higher numbers were the “correct” ones, and somehow, the DAP numbers were incorrect.\u003c/p\u003e\n\u003cp\u003eThe first thing I told him was that, unfortunately, the two numbers will never be \u003ca href=\"http://fivethirtyeight.com/features/why-we-still-cant-agree-on-web-metrics/\" target=\"_blank\"\u003eexactly the same\u003c/a\u003e. Tracking with two different tools or in this case, even two instances of the same tool, won’t end up reporting perfect matches (since the DAP code is custom-built). That said, a 33% delta was far too much; and after some thought, I figured out what the main problem was. \u003cdiv class=\"image\"\u003e\n  \u003cimg\n    src=\"https://s3.amazonaws.com/digitalgov/_legacy-img/2015/09/600-x-400-Robot-Spider-with-clipping-path-Linda-Bucklin-iStock-Thinkstock-139863441.jpg\"\n    alt=\"Illustration of a robot spider\"/\u003e\u003c/div\u003e\n\n\u003c/p\u003e\n\u003cp\u003eSpiders.\u003c/p\u003e\n\u003cp\u003eOk, not just spiders, but spiders and robots—of the digital kind. \u003ca href=\"http://en.wikipedia.org/wiki/Internet_bot\" target=\"_blank\"\u003eInternet bots\u003c/a\u003e and \u003ca href=\"http://en.wikipedia.org/wiki/Web_crawler\" target=\"_blank\"\u003eWeb crawlers \u003c/a\u003e(better known as spiders) \u003ca href=\"https://www.incapsula.com/blog/bot-traffic-report-2014.html\" target=\"_blank\"\u003ecan account for a lot of traffic\u003c/a\u003e in the digital universe. These little digital “creatures” are software applications that run automated tasks on the Internet and record data much faster than a human. There are “good” and “bad” versions of them. For example, “good” spiders are often used to index data for updating content or for search engine use. Unfortunately, these automated tasks can be reported as visits to your pages.\u003c/p\u003e\n\u003cp\u003eIn the summer of 2014, \u003ca href=\"https://plus.google.com/+GoogleAnalytics/posts/2tJ79CkfnZk\" target=\"_blank\"\u003eGoogle announced\u003c/a\u003e that it had added a new feature to Google Analytics that filters out bots and spiders based on a \u003ca href=\"http://www.iab.net/1418/spiders\" target=\"_blank\"\u003econstantly updated list\u003c/a\u003e that usually costs thousands of dollars to access. In autumn 2014, the DAP staff chose to implement the filter, which can be done in a click.\u003c/p\u003e\n\u003cdiv class=\"image\"\u003e\n  \u003cimg\n    src=\"https://s3.amazonaws.com/digitalgov/_legacy-img/2015/09/328-x-74-Bot-Filter.jpg\"\n    alt=\"Screen capture of a bot filter\"/\u003e\u003c/div\u003e\n\n\n\u003cp\u003eSo I asked the DAP user if this option was enabled in his independent implementation (by default in Google Analytics, it is NOT turned on), and he responded that it was not, but that he’d turn it on to give it a shot.\u003c/p\u003e\n\u003cp\u003eA few days later, we revisited the data. It turns out, bots and spiders represented a significant portion of the pageviews the independent account was recording, especially to the homepage. By using this feature, we effectively changed the delta from 33% to a single-digit percentage.\u003c/p\u003e\n\u003cp\u003eIf your agency is participating in the DAP and also running independent analytics tools, we encourage you to examine your data and compare. If you are not participating, we recommend you check with your analytics provider to see if it has a similar feature, or ask how bot and spider traffic can be accounted for.\u003c/p\u003e\n\u003cp\u003eHigh numbers are great, but human numbers are better.\u003c/p\u003e\n"}
  ]
}
