This software allows users to extract data from a variety of sources for automated data mining and conversion to facilitate more efficient extraction of relevant data into relational databases. Within the past few decades, the use of digital data collection and storage by corporations, governments and other entities has increased exponentially. This data is usually converted into a standard structured data format, such as XML, JSON, or others. Because the volume of data generated is far greater than can be manually analyzed, various XML shredding processes attempt to parse XML and structured data into relational databases, to facilitate easier data anaylsis, reporting and graphics generation. Unfortunately, the complexities and inconsistencies within XML or structured data objects results in wasted database memory and inefficient access to the relevant data. To correct this problem, researchers at the University of Florida have developed software to facilitate user-directed database configuration and automated mining and conversion of structured data.
Software to facilitate quick and efficient data mining and conversion
This package, consisting of a software program and method and allows a user to provide the desired object types and list of associated fields of interest occurring in a data source, a set of source files or structured data (such as XML files). The software retrieves the data and generates primary keys along with the relational tables containing the compiled fields.