Can We Please Stop Saying "Unstructured" Data?

Can We Please Stop Saying “Unstructured” Data?:

Grant Ingersoll (LucidWorks) in an article on GigaOm asking for a different term than “unstructured” data to characterize natural form text:

Text is easily one of the most highly structured data types we face, filledwith misspellings, misdirection, flowery language, ambiguity and implicitknowledge. Text is so often misunderstood that researchers in the field evenhave a metric (inter-annotator agreement) that tracks how often two peopleexamining the same piece of text agree on the answer to some question on thetext.

Some random thoughts:

    Unstructured data doesn’t refer only to natural form text. Speaking of text, from the 4 years of (Romanian) grammar I’ve learned in school, the thing I remember the best is the countless exceptions. To me that sounds like lack of structure. I’m pretty sure there are analysts out there that have come up with different terms, but sometimes having everyone understand the meaning of a term is more important than the term itself.

Original title and link: Can We Please Stop Saying “Unstructured” Data? (NoSQL database?myNoSQL)

Can We Please Stop Saying "Unstructured" Data?

相关文章:

你感兴趣的文章:

标签云: