click below
click below
Normal Size Small Size show me how
Database SA3
| Term | Definition |
|---|---|
| He was considered as father of Data Warehouse. | INMON BILL |
| Client-side front end tools is in ______ tier. | TOP |
| Raising ownership, dimensional technique and extra reporting are some ______ of data warehouse. | DISADVANTAGES |
| An EDW architecture with data mart level is a three-tier architecture. | False |
| ODS support only daily operations, so their view of historical data is very limited. | True |
| The ‘T’ in the acronym OLTP, stands for: TRANSACTIONS. | True |
| The acronym OLAP stands for: Online Analysis Processing. | False |
| ODS support only ______ operations, so their view of historical data is very limited. | All the options |
| ODS uses _______ systems to manage dynamic data in real-time. | OLTP |
| An EDW architecture with OLAP | Three-Tier |
| An EDW component that refers to the tools that give end users access to data. | REPORTING LAYER |
| A _______ data mart allows sourcing organization’s data from a single data warehouse. | DEPENDENT |
| A schema where multiple fact tables share dimension tables. | GALAXY SCHEMA |
| A ________ is a logical description that describes the entire database. | SCHEMA |
| The _______ table is a collection of reference information about a measurable in the fact table. | DIMENSION |
| The disadvantage of the snowflake schema it uses large disk space. | False |
| The Galaxy schema can have one fact table and a number of associated dimension tables. | False (STAR SCHEMA) |
| The _____ tier mainly consists of the Data Sources, ETL Tool, and Data Warehouse. | BOTTOM |
| OLAP Server is in _______ tier. | MIDDLE |
| The ________ is also known as backend component. | QUERY MANAGER |
| Data Warehousing started in the late 1980s when Paul Murphy and ___________ developed the Business Data Warehouse. | Barry Devlin |
| An EDW with data mart level is a two-tier architecture. | True |
| The ETL tool transform data in the staging area before EDW. | True |
| The Data Mart is a type of data warehouse that is refreshed in real time. | False (Data warehouse) |
| The Designing step of Data Mart implementation involves creating the physical database and the logical structures. | False (Constructing) |
| A _______ data mart is created without the use of a central data warehouse. | INDEPENDENT |
| The ______tool transform data in the staging area before EDW. | ETL |
| The acronym ODS stands for : __________________. | OPERATIONAL DATA STORE |
| The ______ table contains the foreign key column that allows joins with dimension tables. | FACT |
| The Fact table is consist of the key column and measures or ______. | FACT DATA |
| Galaxy schema is also know as FACT CONSTELLATION SCHEMA. | True |
| A data warehouse architecture where metadata, summary data, and raw data are stored within the central repository of the warehouse. | SIMPLE |
| Data Warehousing started in the late 1980s when Paul _____ and Barry Devlin developed the Business Data Warehouse. | MURPHY |
| Type of data warehouse which is refreshed in real time. | ODS |
| A ______ data mart that can take data from data warehouses or operational systems. | HYBRID |
| An EDW with database directly connected with the analytical interfaces where the end user can make queries. | One-tier |
| The Star Schema is also know as _______. | STAR JOIN SCHEMA |
| The ______ Schema is also know as STAR JOIN SCHEMA). | STAR |
| The fact table contains the primary key column that allows joins with dimension tables. | FALSE |
| The fact table measures those that contain numeric facts. | True |
| Data warehouse architecture with data marts. | HUB AND SPOKE |
| The load manager is also known as _______ component. | FRONT |
| The acronym OLAP stands for: Online Analytical Processing. | True |
| The ‘T’ in the acronym OLTP, stands for: TRANSFORM. | False |
| An EDW with data mart level is a three-tier architecture. | False (Two-tier) |
| Data Warehouse uses ________ systems organize and present information in specific formats to accommodate the diverse needs of various users. | OLAP |
| In ______ schema the dimension tables are normalized which splits data into additional tables. | SNOWFLAKE |
| An extension of star schema where the dimension tables are connected to one or more dimensions. | SNOWFLAKE SCHEMA |
| These tables hold fields that represent the direct facts, as well as the foreign fields that are used to connect the fact table with other dimension tables in the Data Warehouse system. | FACT TABLE |
| The Star schema can have one fact table and a number of associated dimension tables. | True |
| Alternative name for Data Warehouse System is EIS. What is the full form of this acronym? | Executive Information System |
| Data warehouse in Banking, Telecommunication, Financial Services are examples of __________. | Applications of Data Warehouse |
| What is the full form of the acronym OLTP? | Online Transactions Processing Databases |
| The ‘T’ in the acronym OLTP, stands for: ___________. | TRANSACTIONS |
| Data Mart implementation step that involves putting the data to use: querying the data, creating reports, charts, and publishing them. | Accessing |
| The Star Schema is also know as JOINT SCHEMA. | False (STAR JOIN SHEMA) |
| The _______ is also called the front component. | LOAD MANAGER |
| The middle tier consists of the _______ servers. | OLAP |
| Data mining tool is an example of: | END-USER ACCESS TOOL |
| Enhanced Business Intelligence, query process and timely access to data are examples of ________. | Advantages of data warehouse |
| An electronic storage of a large amount of information by a business or organization. | DATA WAREHOUSE |
| A _______ is a specific type of database that represents data from multiple dimensions. | OLAP CUBE |
| In Star Schema, Every dimension in a star schema is represented with the only one-dimension table. | True |
| A type of Data Management System that is solely intended to perform queries and analysis from large amount of historical data to support BI activities. | WAREHOUSE |
| Data mart has a more implementation time compared to data warehouse systems. | False |
| An EDW with OLAP level is a two-tier architecture. | False |
| An EDW architecture with data mart level | Two-Tier |
| An EDW with _________ level is a two-tier architecture. | DATA MART |
| A data warehouse architecture where operational data must be cleaned and processed before being put in the warehouse. | SIMPLE WITH STAGING AREA |
| The OLTP is used to manage dynamic data in real-time. | TRUE |
| The primary key column of the fact table has uniquely identifies each dimension record or row. | FALSE |
| The QUERY MANAGER is also known as the _____ component. | BACKEND |
| A data warehouse component that performs operations associated with the management of the data in the warehouse. | WAREHOUSE MANAGER |
| The ODS is a type of data warehouse that is refreshed in real time. | TRUE |
| The fact table contains the foreign key column that allows joins with ________ tables. | DIMENSION |
| The ______ schema can have one fact table and a number of associated dimension tables. | STAR |
| The LOAD MANAGER is also called the ______ component. | FRONT |
| The Fact table measures is also known as numeric data. | TRUE |
| The tool that transform data in the staging area before EDW. | ETL |
| In Galaxy schema, it is possible to build this type of schema by splitting the one-star schema into more Star schemes. | TRUE |
| The Fact table measures is also known as fact data. Group of answer choices | TRUE |
| Galaxy schema is also know as ________. | FACT CONSTELLATION SCHEMA |
| Data warehouse component that transforms data in preparation for entering the data warehouse. | LOAD MANAGER |
| The middle tier consists of the _________. | OLAP SERVERS |
| Galaxy schema is also know as STAR CONSTELLATION SCHEMA. | FALSE |
| The fact table measures those that contain ________ facts. | NUMERIC |
| An OLAP operation that performs the analysis by taking one level of information for display. | SLICE |
| OLAP is located in between Front-end tools and _____________. | DATA WAREHOUSE |
| ETL is an acronym for : ______, _________,________. Note: Separate each word by a space. | EXTRACT TRANSFORM LOAD |
| The HOLAP system is a blend of _______ and MOLAP. Note: Write only the acronym. | ROLAP |
| What is the full form of the acronym OLAP? | ONLINE ANALYTICAL PROCESSING |
| An OLAP system that is created to facilitate management of both spatial and non-spatial data in a Geographic Information system. | SOLAP |
| An ETL loading type that applies ongoing changes as when needed periodically. | INCREMENTAL LOAD |
| A ______ is a data structure that allows fast analysis of data according to the multiple dimensions that define a business problem. | OLAP CUBE |
| An OLAP operation that performs analysis in deeper among the dimensions of data. | DRILL-DOWN |
| Artificial Intelligence refers to the algorithms that can learn from data to make predictions. | False (machine learning) |
| The descriptive analysis provides answers of the future queries that move across using historical data as the chief principle for decisions. | False (predictive analysis) |
| Classification technique is a supervised learning. | True |
| Decision Tree is an example of an association algorithm. | False |
| The EVALUATION phase of CRISP-DM that consists of presenting the results in a useful and understandable manner, and by achieving this, the project should achieve its goals. | False (deployment) |
| The ________ is about processing data and identifying patterns and trends in that information so that you can decide or judge. | DATA MINING |
| A DM algorithm that constructs a classifier in the form of a decision tree. | C4.5 |
| What is the 4th step in CRISP-DM? | MODELING |
| A DM algorithm that do not require a predefined set of outputs but rather look for patterns or trends without any label or target. | UNSUPERVISED LEARNING |
| The CRISP-DM phase that sets the initial data collection and proceeds with activities in order to get familiar with the data. | Data understanding |
| The full form of DOLAP acronym is ________ OLAP. | DESKTOP |
| The ________ type of ETL Loading process populates all the data warehouse tables. | INITIAL LOAD |
| An OLAP operation that is also known as consolidation, use to summarize operation data along with the dimension. Group of answer choices SLICE Not in the options DRILL-UP DICE | DRILL-UP |
| An ETL loading type that populates all the data warehouse tables. | INITIAL LOAD |
| Clustering is an unsupervised learning. | True |
| Clustering technique is a supervised learning. | False |
| Machine Learning refers to the algorithms that can learn from data to make predictions. | True |
| A data mining technique that is used to make predictions based on relationships within the data set. | REGRESSION |
| Data mining is applied in _________ websites to offer cross-sells and up-sells through their websites. | E-COMMERCE |
| KDD stands for: | Knowledge Discovery in Databases |
| The Naïve Bayes algorithm is a _________ data mining. | CLASSIFICATION |
| The type of OLAP system that works on the information that resides in a relational database. Note: Write only the acronym. | ROLAP |
| An ETL loading type that erase the contents of one or more tables and reloading with fresh data. | FULL REFRESH LOAD |
| This strategy is also known delta, where only the data being changed is extracted and update data warehouses. | Partial Extraction (with update notification) |
| The ______ operation perform the analysis that can gain a new view of data by rotating the data axes of the cube. | PIVOT |
| Classification technique is an unsupervised learning. | False |
| The predictive analysis provides answers of the future queries that move across using historical data as the chief principle for decisions. | True |
| The DEPLOYMENT phase of CRISP-DM that consists of presenting the results in a useful and understandable manner, and by achieving this, the project should achieve its goals. | True |
| The _______ analysis provides answers of the future queries that move across using historical data as the chief principle for decisions. | PREDICTIVE |
| It refers to the numeric study of data relationships. | STATISTICS |
| An OLAP system that utilizes multi-dimensional database for storing and analyzing information. Note: Write only the acronym. | MOLAP |
| Which type of ETL Loading process populates all the data warehouse tables? | INITIAL LOAD |
| Clustering technique is an unsupervised learning. | True |
| Clustering technique is a(n) _________ learning. | UNSUPERVISED |
| Decision Tree is an example of a classification algorithm. | True |
| A data mining technique that is used to determine when something is noticeably different from the regular pattern. | ANOMALY DETECTION |
| A type of data mining algorithm that is used to mine data and provide the latest information on past or recent events. | DESCRIPTIVE ANALYSIS |
| A data mining algorithm that requires a label or target. | SUPERVISED LEARNING |
| The phase of CRISP-DM that consists of presenting the results in a useful and understandable manner, and by achieving this, the project should achieve its goals. | DEPLOYMENT |
| The full form of DOLAP acronym is ________ OLAP. | DESKTOP |
| Classification technique is a(n) ________ learning. | SUPERVISED |
| A DM algorithm that has an assumption: Every feature of the data being classified is independent of all other features given the class. | NAIVE BAYES |
| Unsupervised learning is a predictive analysis. | False (Descriptive Analysis) |
| These are algorithms that can learn from data to make predictions. | MACHINE LEARNING |
| Decision Tree is an example of a(n) _________ algorithm. | CLASSIFICATION |
| In the 9-step KDD process, choosing a data mining task is the same as choosing data mining algorithm. | FALSE |
| The fact table contains the ________ key column that allows joins with dimension tables. | FOREIGN |