{
    "version" : "https://jsonfeed.org/version/1",
    "content" : "news",
    "type" : "single",
    "title" : "The Data Briefing: Tales from the Dark Side of Data |Digital.gov",
    "description": "The Data Briefing: Tales from the Dark Side of Data",
    "home_page_url" : "/preview/gsa/digitalgov.gov/bc-archive-content-3/","feed_url" : "/preview/gsa/digitalgov.gov/bc-archive-content-3/2016/04/27/the-data-briefing-tales-from-the-dark-side-of-data/index.json","item" : [
    {"title" :"The Data Briefing: Tales from the Dark Side of Data","summary" : "There are many scary tales in the world of knowledge management and data management. Tales of missing data that was lost through the administrative cracks, such as the story of the missing Apollo 11 moonwalk tapes that most likely were erased by accident. Or the 36-year search for the original Wright Brothers’ patent, which was","date" : "2016-04-27T10:00:50-04:00","date_modified" : "2025-01-27T19:42:55-05:00","authors" : {"bbrantley" : "Bill Brantley"},"topics" : {
        
            "emerging-tech" : "Emerging tech",
            "open-data" : "Open data"
            },"branch" : "bc-archive-content-3",
      "filename" :"2016-04-27-the-data-briefing-tales-from-the-dark-side-of-data.md",
      
      "filepath" :"news/2016/04/2016-04-27-the-data-briefing-tales-from-the-dark-side-of-data.md",
      "filepathURL" :"https://github.com/GSA/digitalgov.gov/blob/bc-archive-content-3/content/news/2016/04/2016-04-27-the-data-briefing-tales-from-the-dark-side-of-data.md",
      "editpathURL" :"https://github.com/GSA/digitalgov.gov/edit/bc-archive-content-3/content/news/2016/04/2016-04-27-the-data-briefing-tales-from-the-dark-side-of-data.md","slug" : "the-data-briefing-tales-from-the-dark-side-of-data","url" : "/preview/gsa/digitalgov.gov/bc-archive-content-3/2016/04/27/the-data-briefing-tales-from-the-dark-side-of-data/","content" :"\u003cp\u003eThere are many scary tales in the world of knowledge management and data management. Tales of missing data that was lost through the administrative cracks, such as the story of the \u003ca href=\"http://www.npr.org/2009/07/16/106637066/houston-we-erased-the-apollo-11-tapes\" target=\"_blank\"\u003emissing Apollo 11 moonwalk tapes\u003c/a\u003e that most likely were erased by accident. Or the \u003ca href=\"https://www.washingtonpost.com/local/lost-plans-for-wright-brothers-flying-machine-found-after-36-years/2016/04/02/e526fd56-f6b2-11e5-9804-537defcc3cf6_story.html\" target=\"_blank\"\u003e36-year search for the original Wright Brothers’ patent\u003c/a\u003e, which was happily re-discovered this month. As more data is being created at ever-increasing speed and complexity, there will be more missing data horror stories. \u003cdiv class=\"image\"\u003e\n  \u003cimg\n    src=\"https://s3.amazonaws.com/digitalgov/_legacy-img/2016/04/600-x-400-Businesswoman-with-magnifier-glass-iStock-Thinkstock-178453035.jpg\"\n    alt=\"Businesswoman searches for data with magnifying glass.\"/\u003e\u003c/div\u003e\n\n\u003c/p\u003e\n\u003cp\u003eData is easier to create now than ever before in history. The government has always been a major creator and collector of data. Whenever I think of government data, I think of the enormous government warehouse at the end of Raiders of the Lost Ark. That warehouse has grown even larger as more and more data has been lost in the endless rows of storage. I wrote about the \u003ca href=\"/preview/gsa/digitalgov.gov/bc-archive-content-3/2015/06/17/the-api-briefing-the-challenge-of-governments-dark-data/\" target=\"_blank\"\u003eproblem of “dark data” in a column last year\u003c/a\u003e:\u003c/p\u003e\n\u003cp\u003e“When different parts of the organization are not collaborating on data collection or data analysis, they could also create dark data. Also, the organization may not have the tools or the analytical skills to analyze the collected data. Finally, dark data may happen because the organization does not have good documentation on the organizational datasets. Dark data is like an attic or storage shed, where boxes of information are stored away with the promise that we will, one day, get around to working with the data.”\u003c/p\u003e\n\u003cp\u003eDark data, if kept for too long, can become toxic data. At a panel for a recent government-wide event, military officials discussed the \u003ca href=\"https://fcw.com/articles/2016/04/15/toxic-data-lyngaas.aspx\" target=\"_blank\"\u003edangers of storing data past its useful life\u003c/a\u003e: “Datasets created and stored before the development of advanced cybersecurity protections can potentially offer easy pathways for hackers.” Systems to store and utilize the data have to be maintained way past their useful life, which also increases the risk of hackers penetrating other, more modern systems through well-known vulnerabilities in the older systems. Like chains, which are only as strong as their weakest links, computer networks are only as secure as their most vulnerable network component.\u003c/p\u003e\n\u003cp\u003eI’ve been in several projects where a data solution was implemented because of immediate need. There was some data modeling and analysis performed, but not enough long-range planning was performed to help future-proof the datasets. Thus, you have data locked into old systems where the limitations of the systems prevent the office from adopting newer, more effective systems. In one project, we just had to cut our losses because it was impossible to migrate the data from the old system into the new database. Therefore, the past data is locked away with very little hope to make it accessible again.\u003c/p\u003e\n\u003cp\u003eSometimes, data science reads like an Edgar Allen Poe story. Datasets are locked away in a forgotten (computer) prison or trapped behind a (data silo) brick wall. Datasets grow toxic over time and turn into menaces to network security.\u003c/p\u003e\n\u003cp\u003eData has to be managed, stored and used carefully and effectively. The ability to create and collect data has become a blessing to the government as it helps us to make better-informed decisions and is a \u003ca href=\"/preview/gsa/digitalgov.gov/bc-archive-content-3/2015/04/15/the-api-briefing-how-essential-is-government-data-to-the-american-economy/\"\u003emajor component of today’s global economy\u003c/a\u003e. However, data can also be a curse because of bad decisions based on expired data or lost data. The key is to have clear objectives for collecting and using the data while having a management plan for the lifecycle of the data. Data horror stories are not good for government or the American public.\u003c/p\u003e\n\u003cp\u003e\u003cem\u003eEach week, \u003ca href=\"/preview/gsa/digitalgov.gov/bc-archive-content-3/topics/emerging-tech/\"\u003eThe Data Briefing\u003c/a\u003e showcases the latest federal data news and trends. Dr. William Brantley is the Training Administrator for the U.S. Patent and Trademark Office’s Global Intellectual Property Academy. You can find out more about his personal work in open data, analytics, and related topics at \u003ca href=\"http://billbrantley.com/\"\u003eBillBrantley.com\u003c/a\u003e. All opinions are his own and do not reflect the opinions of the USPTO or GSA.\u003c/em\u003e\u003c/p\u003e\n"}
  ]
}
