Bill Inmon takes a general, non technical look at the problem of dealing with textual data. It remains a more difficult form of data to deal with. He introduces the use of ETL to convert text to relational databases. His introduction:
" .. It is stated that in most corporations 80% of the data is textual. In some corporations – insurance companies, for one – that ratio may be low. And for a long time it has been held that textual data cannot be manipulated by a computer. Textual data is notoriously non repetitive, and non-repetitive data simply does not fit well into a standard database management system. Database management systems are built for data with a structure that repeats itself over and over .. "
Wednesday, February 08, 2012
Subscribe to:
Post Comments (Atom)
No comments:
Post a Comment