{
    "version" : "https://jsonfeed.org/version/1",
    "content" : "guides",
    "type" : "single",
    "title" : "Understanding the Site Scanning program |Digital.gov",
    "description": "Understanding the Site Scanning program",
    "home_page_url" : "/preview/gsa/digitalgov.gov/bc-archive-content-3/","feed_url" : "/preview/gsa/digitalgov.gov/bc-archive-content-3/guides/site-scanning/index.json","item" : [
    {"title" :"Understanding the Site Scanning program","deck" : "A set of daily scans of the federal web presence.","summary" : "This program is available to automatically generate data about the health and best practices of federal websites.","date" : "2020-06-25T09:00:00-05:00","date_modified" : "2025-01-27T19:42:55-05:00","topics" : {
        
            "analytics" : "Analytics",
            "budgeting-and-performance" : "Budgeting and performance"
            },"primary_image" : { "uid" : "guide-site-scanning", "alt" :
  "A person works in front of a computer with many internet symbols on it", "width" :
  "1200", "height" :
  "630", "credit" :
  "agny_illustration/iStock via Getty Images", "caption" :
  "", "format" :
  "png" },"branch" : "bc-archive-content-3",
      "filename" :"_index.md",
      
      "filepath" :"guides/site-scanning/_index.md",
      "filepathURL" :"https://github.com/GSA/digitalgov.gov/blob/bc-archive-content-3/content/guides/site-scanning/_index.md",
      "editpathURL" :"https://github.com/GSA/digitalgov.gov/edit/bc-archive-content-3/content/guides/site-scanning/_index.md","url" : "/preview/gsa/digitalgov.gov/bc-archive-content-3/guides/site-scanning/","aliases" : {"0" : "/guide/site-scanning/","1" : "/site-scanning/","2" : "/sitescanning/","3" : "/site-scan/","4" : "/sitescan/","5" : "/site-scans/","6" : "/sitescans/"},"weight" : "2","content" :"\u003cp\u003e\u003cstrong\u003eThe Site Scanning program\u003c/strong\u003e automates a wide range of scans of public federal websites and generates data about website health, policy compliance, and best practices.\u003c/p\u003e\n\u003cp\u003eThe program is a shared service provided at no cost for federal agencies and the public to use. At its core is the Federal Website Index, a reference dataset listing all public federal .gov sites by agency/department. Daily scans generate over 1.5 million fields of data about 26,000 federal .gov websites, made publicly available via API and bulk download.\u003c/p\u003e\n\u003cp\u003e\u003cstrong\u003eWe scan federal domains for:\u003c/strong\u003e\u003c/p\u003e\n\u003cul\u003e\n\u003cli\u003eThe presence of agency websites and subdomains\u003c/li\u003e\n\u003cli\u003eDigital Analytics Program participation\u003c/li\u003e\n\u003cli\u003eUse of the US Web Design System\u003c/li\u003e\n\u003cli\u003eSearch engine optimization\u003c/li\u003e\n\u003cli\u003eThird party services\u003c/li\u003e\n\u003cli\u003eIPv6 compliance\u003c/li\u003e\n\u003cli\u003eOther best practices\u003c/li\u003e\n\u003c/ul\u003e\n\u003ch2 id=\"access-the-data-directly\"\u003eAccess the data directly\u003c/h2\u003e\n\u003cp\u003eAll scan data can be downloaded directly as a \u003ca href=\"data/\"\u003eCSV or JSON file\u003c/a\u003e or accessed through the \u003ca href=\"https://open.gsa.gov/api/site-scanning-api/\"\u003eSite Scanning API\u003c/a\u003e.\u003c/p\u003e\n\u003ch2 id=\"learn-more-about-the-program-the-scans-and-the-underlying-data\"\u003eLearn more about the program, the scans, and the underlying data\u003c/h2\u003e\n\u003cp\u003eMuch deeper program detail can be found in the program\u0026rsquo;s \u003ca href=\"https://github.com/gsa/site-scanning-documentation\"\u003edocumentation hub\u003c/a\u003e. The major sections include:\u003c/p\u003e\n\u003cul\u003e\n\u003cli\u003e\u003ca href=\"https://github.com/gsa/site-scanning-documentation#about\"\u003eAbout the program\u003c/a\u003e\u003c/li\u003e\n\u003cli\u003e\u003ca href=\"https://github.com/gsa/site-scanning-documentation#understanding-the-data\"\u003eUnderstanding the data\u003c/a\u003e\u003c/li\u003e\n\u003cli\u003e\u003ca href=\"https://github.com/gsa/site-scanning-documentation#program-management\"\u003eProgram management\u003c/a\u003e\u003c/li\u003e\n\u003c/ul\u003e\n\u003cp\u003eThe creation of the underlying website index is explained in the separate \u003ca href=\"https://github.com/GSA/federal-website-index\"\u003eFederal Website Index repository\u003c/a\u003e. It includes links to the original datasets, as well as descriptions of how they are assembled and filtered in order to create the list of URLs that are then scanned.\u003c/p\u003e\n\u003ch2 id=\"contact-the-site-scanning-team\"\u003eContact the Site Scanning team\u003c/h2\u003e\n\u003cp\u003e\u003cstrong\u003eQuestions?\u003c/strong\u003e Email the Site Scanning team at \u003ca href=\"mailto:site-scanning@gsa.gov\"\u003esite-scanning@gsa.gov\u003c/a\u003e.\u003c/p\u003e\n"}
  ]
}
