TOWARDS EFFICIENT DATA ANALYSIS AND MANAGEMENT OF SEMI-STRUCTURED DATA
Author: Shirish Tatikonda
Publisher: Ohio State University
Publication date: 2010
Number of pages: 218
Format / Quality: pdf
Over the last decade, there has been an enormous growth in both the amount and the complexity of online content that is collected and processed by humans and machines. Such a growth has spurred interest in exible and uid (semi-structured) data models that do not constrain the data to follow a _xed schema. Many applications ranging from bioinformatics to XML repositories, from software engineering to computational linguistics, are now generating and processing large amounts of semi-structured data. For these applications to reach their full potential, we need to build an e_ective set of tools to index, process, manage, and analyze such data.