Term | Definition |
petabyte | 10^15 bytes |
exabyte | 10^18 bytes |
business intelligence (BI) | information that results from the processing of operational data to create information that exposes patterns, relationships, and trends of importance to the organization |
business intelligence (BI) system | a system that provides the right information, to the right user, at the right time. A tool produces the info but a system makes it right |
reporting systems | a system that creates information from disparate sources and delivers that information to the proper users on a timely basis |
data mining systems | IS that processes data using sophisticated statistical techniques to find patterns and relationships that cannot be found by simpler operations |
market-based analysis | data mining technique for determining sales patterns. shows the products that customers tend to buy together |
knowledge management systems | IS for storing and retrieving organizational knowledge, whether that knowledge is in the form of data, documents, or employee know-how |
Expert systems | knowledge-sharing system that is created by interviewing experts in a given business domain and codifying the rules used by those experts |
dirty data | problematic data |
granularity | level of detail in data |
clickstream data | e-commerce data that describes a customer's clicking behavior. Includes everything the customer does at the Web site |
curse of dimensionality | the more attributes there are, the easier it is to build a data model that fits the sample data but is useless as a predictor |
data mining | the application of statistical techniques to find patterns and relationships among data |
unsupervised data mining | data mining whereby the analysts do not create a model or hypothesis before running the analysis. Create hypothesis after results are found |
cluster analysis | unsupervised data mining technique whereby statistical techniques are used to ID groups of entities with similar characteristics |
supervised data mining | data mining where miners develop a model prior to the analysis and statistical techniques to data to estimate values of the parameters of the model |
regression analysis | linear estimator |
neural networks | supervised data mining technique used to predict values and make classifications |