Application Database Management System

Not all of this data is required on a regular basis and should be filtered out of the query tables. It is not always necessary to mine the contents of the entire table to identify useful information. Potential sources of data that may be useful in a data mining application include:

  • Census data
  • Sales records
  • Mailing lists
  • Demographic databases.

Data Problems

‘Dirty’ or inaccurate data in the mining data store must be avoided if results are to be accurate and useful. Many data mining tools include a system log or other graphical interface tool to identify erroneous data in queries, but every effort should be made prior to this stage to ensure that it does not arrive at the mining database.

Discovery-driven Systems

In a data mining environment, data warehouse, query generators, and data interpretation components, are combined with discovery-driven systems to provide the capability to automatically reveal important yet hidden data. The following tasks need to be completed to make full use of data mining: