click below
click below
Normal Size Small Size show me how
Stack #4410646
| Question | Answer |
|---|---|
| A data warehouse architecture where metadata, summary data, and raw data are stored within the central repository of the warehouse. | SIMPLE |
| He was considered as the father of Data Warehouse. | INMON BILL |
| A data warehouse architecture where operational data must be cleaned and processed before being put in the warehouse. | SIMPLE WITH STAGING AREA |
| A data warehouse in Banking, Telecommunication, and Financial Services are examples of ________. | APPLICATIONS OF DATA WAREHOUSE |
| A specific type of database that represents data from multiple dimensions. | OLAP CUBE |
| The _____ table is a collection of reference information about a measurable in the fact table. | DIMENSION |
| The HOLAP system is a blend of _____ and MOLAP. | ROLAP |
| An OLAP system that is created to facilitate management of both spatial and non-spatial data in a Geographic Information System. | SOLAP |
| The type of OLAP system that works on the information that resides in a relational database. | ROLAP |
| An OLAP system that utilizes a multi-dimensional database for storing and analyzing information. | MOLAP |
| The full form of DOLAP acronym is ______ OLAP. | DESKTOP |
| OLAP is located in between Front-end tools and ___________. | DATA WAREHOUSE |
| ETL (Extract, Transform, Load) is an acronym for: __, ___, __. | EXTRACT, TRANSFORM, LOAD |
| The ETL tool transforms data in the staging area before EDW. | TRUE |
| ETL loading type that applies ongoing changes as when needed periodically. | INCREMENTAL LOAD |
| An ETL loading type that populates all the data warehouse tables. | INITIAL LOAD |
| An ETL loading type that erases the contents of one or more tables and reloads with fresh data. | FULL REFRESH LOAD |
| This strategy is also known as delta, where only the data being changed is extracted and updates data warehouses. | PARTIAL EXTRACTION (WITH UPDATE NOTIFICATION) |
| An OLAP operation that performs the analysis by taking one level of information for display. | SLICE |
| An OLAP operation that performs analysis in deeper dimensions of data. | DRILL-DOWN |
| An OLAP operation that is also known as consolidation, used to summarize operational data along with the dimension. | DRILL-UP |
| The ____ operation performs the analysis that can gain a new view of data by rotating the data axes of the cube. | PIVOT |
| Data Mining & Machine Learning: The ______ is about processing data and identifying patterns and trends in that information so that you can decide or judge. | DATA MINING |
| A DM algorithm that constructs a classifier in the form of a decision tree. | C4.5 |
| A DM algorithm that does not require a predefined set of outputs but rather looks for patterns or trends without any label or target. | UNSUPERVISED LEARNING |
| A DM algorithm that requires a label or target. | SUPERVISED LEARNING |
| A DM algorithm that has an assumption: Every feature of the data being classified is independent of all other features given the class. | NAÏVE BAYES |
| A data mining technique that is used to determine when something is noticeably different from the regular pattern. | ANOMALY DETECTION |
| A data mining technique that is used to make predictions based on relationships within the dataset. | REGRESSION |
| A type of data mining algorithm that is used to mine data and provide the latest information on past or recent events. | DESCRIPTIVE ANALYSIS |
| Data mining is applied in _______ websites to offer cross-sells and up-sells through their websites. | E-COMMERCE |
| KDD stands for: | KNOWLEDGE DISCOVERY IN DATABASES |
| Machine Learning & Classification: Artificial Intelligence refers to the algorithms that can learn from data to make predictions. | FALSE (MACHINE LEARNING) |
| Machine Learning refers to the algorithms that can learn from data to make predictions. | TRUE |
| Classification technique is a supervised learning. | TRUE |
| Classification technique is an unsupervised learning. | FALSE |
| Classification technique is a(n) _______ learning. | SUPERVISED |
| Decision Tree is an example of an association algorithm. | FALSE |
| Decision Tree is an example of a classification algorithm. | TRUE |
| Decision Tree is an example of a(n) _______ algorithm. | CLASSIFICATION |
| Clustering technique is a supervised learning. | FALSE |
| Clustering technique is an unsupervised learning. | TRUE |
| Clustering technique is a(n) _______ learning. | UNSUPERVISED |
| CRISP-DM (Cross-Industry Standard Process for Data Mining): What is the 4th step in CRISP-DM? | MODELING |
| The CRISP-DM phase that sets the initial data collection and proceeds with activities in order to get familiar with the data. | DATA UNDERSTANDING |
| The phase of CRISP-DM that consists of presenting the results in a useful and understandable manner, and by achieving this, the project should achieve its goals. | DEPLOYMENT |
| The EVALUATION phase of CRISP-DM that consists of presenting the results in a useful and understandable manner, and by achieving this, the project should achieve its goals. | FALSE (DEPLOYMENT) |
| The DEPLOYMENT phase of CRISP-DM that consists of presenting the results in a useful and understandable manner, and by achieving this, the project should achieve its goals. | TRUE |
| Unsupervised learning is a predictive analysis. | FALSE (DESCRIPTIVE ANALYSIS) |
| The predictive analysis provides answers to future queries that move across using historical data as the chief principle for decisions. | TRUE |
| The descriptive analysis provides answers to future queries that move across using historical data as the chief principle for decisions. | FALSE (PREDICTIVE ANALYSIS) |
| The _____ analysis provides answers to future queries that move across using historical data as the chief principle for decisions. | PREDICTIVE |
| It refers to the numeric study of data relationships. | STATISTICS |
| In the 9-step KDD process, choosing a data mining task is the same as choosing a data mining algorithm. | FALSE |
| The Naïve Bayes algorithm is a _______ data mining. | CLASSIFICATION |
| A data warehouse architecture where metadata, summary data, and raw data are stored within the central repository of the warehouse. | SIMPLE |
| He was considered as the father of Data Warehouse. | INMON BILL |
| Raising ownership, dimensional technique, and extra reporting are some ____ of data warehouse. | DISADVANTAGES |
| Data Warehousing started in the late 1980s when Paul ___ and Barry Devlin developed the Business Data Warehouse. | MURPHY |
| An EDW architecture with data mart level is a three-tier architecture. | False |
| The acronym OLAP stands for: Online Analysis Processing. | False (Online Analytical Processing Server) |
| The Data Mart is a type of data warehouse that is refreshed in real time. | False (Data warehouse) |
| The Designing step of Data Mart implementation involves creating the physical database and the logical structures. | False (Constructing) |
| Type of data warehouse which is refreshed in real time. | ODS |
| A ____ data mart that can take data from data warehouses or operational systems. | HYBRID |
| A _____ data mart allows sourcing organization's data from a single data warehouse. | DEPENDENT |
| ODS uses _____ systems to manage dynamic data in real-time. | OLTP |
| An EDW with a database directly connected with the analytical interfaces where the end user can make queries. | One-tier |
| A ______ is a logical description that describes the entire database. | SCHEMA |
| The Star Schema is also known as _____. | STAR JOIN SCHEMA |
| The ____ Schema is also known as STAR JOIN SCHEMA. | STAR |
| The fact table contains the primary key column that allows joins with dimension tables. | True |
| The fact table measures those that contain numeric facts. | True |
| The Fact table measures is also known as fact data. | True |
| Data warehouse architecture with data marts. | HUB AND SPOKE? |
| The load manager is also known as _____ component. | FRONT |
| Data Warehousing started in the late 1980s when Paul Murphy and _________ developed the Business Data Warehouse. | Barry Devlin |
| The acronym OLAP stands for: Online Analytical Processing. | True |
| The 'T' in the acronym OLTP stands for: TRANSFORM. | False |
| The ETL tool transforms data in the staging area before EDW. | True |
| An EDW with data mart level is a three-tier architecture. | False (Two-tier) |
| An EDW architecture with OLAP. | Three-Tier |
| An EDW with a database directly connected with the analytical interfaces where the end user can make queries. | One-tier |
| The ____ tool transforms data in the staging area before EDW. | ETL |
| A _____ data mart is created without the use of a central data warehouse. | INDEPENDENT |
| Data Warehouse uses ______ systems to organize and present information in specific formats to accommodate the diverse needs of various users. | OLAP??? |
| In ____ schema, the dimension tables are normalized, which splits data into additional tables. | SNOWFLAKE |
| An extension of star schema where the dimension tables are connected to one or more dimensions. | SNOWFLAKE SCHEMA |
| These tables hold fields that represent the direct facts, as well as the foreign fields that are used to connect the fact table with other dimension tables in the Data Warehouse system. | FACT TABLE |
| Galaxy schema is also known as FACT CONSTELLATION SCHEMA. | True |
| The Star schema can have one fact table and a number of associated dimension tables. | True |
| Client-side front-end tools are in ____ tier. | TOP |
| Alternative name for Data Warehouse System is EIS. What is the full form of this acronym? | Executive Information System |
| Data warehouse in Banking, Telecommunication, and Financial Services are examples of ________. | Applications of Data Warehouse |
| The 'T' in the acronym OLTP stands for: TRANSACTIONS. | True |
| An EDW with data mart level is a two-tier architecture. | True |
| ODS support only daily operations, so their view of historical data is very limited. | True |
| What is the full form of the acronym OLTP? | Online Transactions Processing Databases |
| The 'T' in the acronym OLTP stands for: _________. | TRANSACTIONS |
| Data Mart implementation step that involves putting the data to use: querying the data, creating reports, charts, and publishing them. | Accessing |
| The ____ table contains the foreign key column that allows joins with dimension tables. | FACT |
| The Galaxy schema can have one fact table and a number of associated dimension tables. | False (STAR SCHEMA) |
| The Star Schema is also known as JOINT SCHEMA. | False (STAR JOIN SCHEMA) |
| The _____ is also called the front component. | LOAD MANAGER |
| The middle tier consists of the _____ servers. | OLAP |
| Data mining tool is an example of: | END-USER ACCESS TOOL |
| Enhanced Business Intelligence, query process, and timely access to data are examples of ______. | Advantages of data warehouse |
| An electronic storage of a large amount of information by a business or organization. | DATA WAREHOUSE |
| A _____ is a specific type of database that represents data from multiple dimensions. | OLAP CUBE |
| The _____ table is a collection of reference information about a measurable in the fact table. | DIMENSION |
| In Star Schema, every dimension in a star schema is represented with only one-dimension table. | True |
| A type of Data Management System that is solely intended to perform queries and analysis from large amounts of historical data to support BI activities. | DATA WAREHOUSE |
| The ___ tier mainly consists of the Data Sources, ETL Tool, and Data Warehouse. | BOTTOM |
| OLAP Server is in _____ tier. | MIDDLE |
| Data mart has a more implementation time compared to data warehouse systems. | False |
| An EDW with OLAP level is a two-tier architecture. | False |
| ODS support only ____ operations, so their view of historical data is very limited. | DAILY |
| An EDW architecture with data mart level. | Two-Tier |
| The Fact table consists of the key column and measures or ____. | FACT DATA |
| A data warehouse architecture where operational data must be cleaned and processed before being put in the warehouse. | SIMPLE WITH STAGING AREA |
| The QUERY MANAGER is also known as the _____ component | BACKEND |
| What is the full form of the acronym BIS? | Business Intelligence Solution |
| The acronym OLAP stands for: Online Analysis Processing. | False |
| An EDW component that refers to the tools that give end users access to data. | Reporting Layer |
| The Star Schema is also known as _______. | Star Join Schema |
| A ______ is a data structure that allows fast analysis of data according to the multiple dimensions that define a business problem. | OLAP CUBE |
| The fact table contains the foreign key column that allows joins with ________ tables. | Dimension |
| A Data Warehouse system can have one or more fact tables. | True |
| The Designing step of Data Mart implementation involves creating the physical database and the logical structures. | False |
| In Galaxy schema, it is possible to build this type of schema by splitting the one-star schema into more Star schemes. | TRUE |
| What is the full form of the acronym OLAP? | ONLINE ANALYTICAL PROCESSING |