User-Configured Data Mining and Conversion Software

Allows Users to Selectively Compile Data from Various Fields of Interest and Generate Output Files for Analyses, Reports, Graphics and More

This software allows users to extract data from a variety of sources for automated data mining and conversion to facilitate more efficient extraction of relevant data into relational databases. Within the past few decades, the use of digital data collection and storage by corporations, governments and other entities has increased exponentially. This data is usually converted into a standard structured data format, such as XML, JSON, or others. Because the volume of data generated is far greater than can be manually analyzed, various XML shredding processes attempt to parse XML and structured data into relational databases, to facilitate easier data anaylsis, reporting and graphics generation. Unfortunately, the complexities and inconsistencies within XML or structured data objects results in wasted database memory and inefficient access to the relevant data.

To correct this problem, researchers at the University of Florida have developed software to facilitate user-directed database configuration and automated mining and conversion of structured data.

 

Application

Software to facilitate quick and efficient data mining and conversion

 

Advantages

  • Prevents creation of unnecessary and underused databases tables; conserving memory and processing resources
  • Allows users to provide desired data object types; facilitating efficient access
  • Makes searches more accurate and precise; saving time and money
  • Creates integral indexed tables, making powerful and efficient SQL analyses and presentations possible

Technology

This package, consisting of a software program and method and allows a user to provide the desired object types and list of associated fields of interest occurring in a data source, a set of source files or structured data (such as XML files). The software retrieves the data and generates primary keys along with the relational tables containing the compiled fields.

Patent Information: