Term
A ___ is a pool of data produced to support decision making enterprise wide. |
|
Definition
|
|
Term
The four major characteristics of data warehousing are: |
|
Definition
- Subject Oriented
- Integrated
- Time-Variant
- Non-Volatile
|
|
|
Term
___ - Characteristic of a DW that says data are organized by topics, such as sales, products, customers, etc. Best for providing a more comprehensive view of the organization; not only how a business is operating, but why. |
|
Definition
|
|
Term
___ - Characteristic of a DW that says data from different sources are stored in a consistent format. Clarity is obtained in units of measures, naming/labeling of attributes, etc. |
|
Definition
|
|
Term
___ - Characteristic of a DW where it provides data at various points in time (daily, weekly, monthly, quarterly, annually, historic, etc.) Every data warehouse should have one. |
|
Definition
|
|
Term
___ - Characteristic of a DW that says users cannot change the data once entered into the data warehouse. This ensure that the DW is almost exclusively available for access. Obsolete data can be deleted and changes are recorded as new data. |
|
Definition
|
|
Term
Additional characteristics of a data warehouse may include:
- Designed for ___ usage/applications
- has __/__ structure
- uses __-__ architecture to provide easy access to end user
- For newer DWs, allows for ___ access to the data
- Contains __ (information that describes what the data is)
|
|
Definition
web-based
relational/multidimensional
client-server
real-time
metadata |
|
|
Term
There are THREE main types of data warehouses |
|
Definition
1. Data Mart
2. Operational Data Stores (ODS)
3. Enterprise Data Warehouses (EDW) |
|
|
Term
___ - a subset of a data warehouse, usually consisting of a single subject area (marketing, sales, customer satisfaction, inventory, production, etc.) |
|
Definition
|
|
Term
___ data mart - created directly from the DW. This ensures that the user is viewing/using the same data available to all other users. Its data source is an EDW. |
|
Definition
|
|
Term
__ data mart - a small warehouse designed for a department or strategic business unit whose source is NOT an EDW. |
|
Definition
|
|
Term
_____ - A type of database often used as an interim (or staging) area for a data warehouse, especially for customer information files. Data are updated frequently through the course of business operations as opposed to the static contents of a data warehouse. |
|
Definition
Operational Data Stores (ODS) |
|
|
Term
__ - a large-scale data warehouse that is used across the enterprise/ company for decision support. Being large-scaled, it integrates data in standard format from many sources. (DirecTV, Enterprise Rental use this type of data warehouse). (It provides data for many types of Decision Support Systems, including CRM, SCM, BPM, BAM, PLM, revenue management, and KMS.) |
|
Definition
Enterprise Data Warehouse (EDW) |
|
|
Term
Problem: NCR was faced with the challenge to grow into an integrated, solution-oriented business structure with a global focus.
Solution? |
|
Definition
|
|
Term
___ - Data about data. Describes the contents of a data warehouse, its structure (field names, data type, default value, length) meaning, syntax, and the manner of its use. |
|
Definition
|
|
Term
The 5 major components of the Data Warehouse Process are: |
|
Definition
- Data Sources
- ETL (Extraction, Transformation, Load) Process
- Comprehensive Database
- Metadata
- Middleware Tools
|
|
|
Term
__ - transactional data (OLTP) such as CRM and ERP data, web logs from the internet, external data (ex: census data), ACCESS, SQL data (and other formats), legacy systems (reference to outdated computer systems), etc. |
|
Definition
|
|
Term
__ - Data are extracted from external data sources using custom ETL software, maintained in a staging area where transformed, cleansed, and integrated, then loaded into the Data Warehouse and/or data marts. (ex: MS SQL Server SSIS) |
|
Definition
ETL (Extraction, Transformation, Load) Process |
|
|
Term
____ - the EDW used to support all decision analyses. |
|
Definition
|
|
Term
___ - are maintained so that it can be used by IT personnel and users; includes software programs with rules for organizing data that can be indexed and searched. |
|
Definition
|
|
Term
__ - tools that access the contents of the data warehouse. These are the front-end applications that users have to interact with data, including data mining, queries, OLAP, predictive analyses, reporting and visualization tools (ex: MS SQL MS, MS Excel with PowerPivot, and others). |
|
Definition
|
|
Term
___ - Computer hardware that provides a specific service used by other computers |
|
Definition
|
|
Term
___ - a type of server that maintains efficient execution of procedures (programs) |
|
Definition
|
|
Term
__ - a type of server, sometimes referred to as "the back end", that holds the database or data warehouse. |
|
Definition
|
|
Term
___ - allows a user to request a server's content or function. |
|
Definition
|
|
Term
List the 3 components of a 3-Tiered Architecture
Advantage: ___
Disadvantage: ___ |
|
Definition
Client Workstation
Application Server
Database Server
Advantage: separates application and database functions when higher capacity is needed and fast performance is important.
Disadvantage: More hardware = increased costs |
|
|
Term
List the two components of a two-tiered architecture
Advantage:
Disadvantage |
|
Definition
Client Workstation
Application Server AND Database (on same server)
Advantage: more economical when the size is not large
Disadvantage: Performance issues when the usage demand is high |
|
|
Term
The 4 issues considered when deciding on the architecture to use? |
|
Definition
- Which DBMS should we use?
- Will parallel processing and/or partitioning be used?
- Will data migration tools be used to load the DW?
- What tools will be used to support data retrieval and analysis?
|
|
|
Term
TeraData Corporation supports the ____ architecture |
|
Definition
|
|
Term
Data integration requires 3 major processes: |
|
Definition
Data Access
Data Federation
Change Capture |
|
|
Term
List the 5 steps of the ETL Process ("SETIL") |
|
Definition
SELECT
EXTRACT
TRANSFORM
INTEGRATE
LOAD |
|
|
Term
To ensure a successful DW implementation process, the business must:
a. __
b. __
c. __
d. __ |
|
Definition
a. Define the plan (business objectives and strategies)
b. Gather support from managers and end users
c. Set reasonable time frames and budgets
d. Manage expectations |
|
|
Term
The Inmon Model uses the ___ approach.
The Kimball Model uses the ___ approach.
Both ultimately result in an EDW. |
|
Definition
EDW (Top Down)
Data Mart Approach
Enterprise Data Warehouse |
|
|