(Advanced Data Warehousing) Chapter 5 - Lesson Summary - Data Warehouse Optimization

About this set

Created by:

jontbrooks  on January 19, 2011

Subjects:

advanced data warehousing

Classes:

Maclean_Dec-Jan

Log in to favorite or report as inappropriate.
Pop out
No Messages

You must log in to discuss this set.

(Advanced Data Warehousing) Chapter 5 - Lesson Summary - Data Warehouse Optimization

Aggregation refers to...
summarizing fact data
1/31

Study:

Cards (new!)

Learn

Test

Speller

Scatter

Games:

Scatter

Space Race

Tools:

Export

Copy

Combine

Embed

Order by

Terms

Definitions

Aggregation refers to... summarizing fact data
You can aggregate data on the fly when you run a _____, or you can aggregate data ahead of time and store the result, which is known as _____ query, pre-aggregation
A base fact table stores... fact data at the lowest levels at which a source system records transactions
An aggregate fact table is a fact table where the data is... pre-aggregated and stored at a higher level for one or more hierarchies
An aggregate fact table is also know as... a summary table
You create aggregate fact tables to... increase query performance
When devising the aggregation strategy for your data warehouse, you should consider the following factors: • query profile
• attribute relationship volatility
• compression rates
You should only create aggregate fact tables that users will... frequently query when running reports
Attributes that have relationships that change frequently are considered... volatile attributes
True or False

Creating aggregate fact tables that include volatile attributes can significantly decrease maintenance overhead?
False (it significantly increases it)
For some database platforms, you can use _____ _____ as an alternative to creating aggregate fact tables materialized views
Materialized views are especially useful if you need to create aggregate fact tables that include... volatile attributes
The compression ratio is the... average number of child records you combine to create a single parent record when aggregating data
You can calculate the compression ratio for two attributes based on the number of... elements that exist for each attribute
Pre-aggregating data is only cost effective if... the compression ratio is significant
Partitioning is the division of... a larger table into smaller tables
There are two basic types of partitioning: • server level
• application level
Server-level partitioning involves... dividing one physical table into logical partitions in the database environment
Application-level partitioning involves... dividing one large table into several separate, smaller physical tables called partition base tables
MicroStrategy supports application-level partitioning for fact tables through... warehouse partitioning mapping or metadata partitioning mapping
Application-level partitioning can provide the following advantages over server-level partitioning: • Ability to partition a single fact table by multiple hierarchies or dimensions
• Logic to determine which partitions must be accessed for a given query without requiring a filter on a specific attribute element
• Less time involved to scan physical partitions rather than logical partitions
When devising the partitioning strategy for your data warehouse, you should consider the following factors: • attribute relationship volatility
• distribution of data across partitions
• partition table size and the number of partition tables
• impact of the partitioning strategy on the ETL and batch process
Partitioning tables based on volatile attributes can significantly increase... maintenance overhead
You should choose a partitioning strategy that minimizes the size of... partition tables while also reducing the number of tables you have to access to resolve queries
Your partitioning strategy should not unnecessarily lengthen or complicate the... ETL or batch process that is used to load data in the warehouse
Indexes are database objects that enable quick access to the _____ in a table based on _____ values. Indexing increases _____ by ordering rows in a table so that the _____ can more easily find the particular _____ that are needed to resolve a given query data, key, performance, database, rows
The simplest kind of index is a _____ _____ index. It is created _____ when you define the primary key for a _____. For tables that contain a large amount of data or are highly _____, you often need to create additional _____ besides the primary key index. primary key, automatically, table, denormalized, indexes
A _____ index has a structure like a family tree. The index begins with a _____, and each row in the table is compared to this _____ value and placed within the B-tree structure. This type of index works best for higher _____ attributes B-tree, root, root, cardinality
A _____ index orders the rows in a table using _____ _____ that are generated and assigned by the _____. This type of index works best for the _____ tables of low _____ attributes. bitmap, binary strings, database, lookup, cardinality
An _____-_____ table stores data in the physical table in index order. This type of index is very useful for _____ tables or _____ tables of high _____ attributes index-organized, fact, lookup, cardinality
When you build indexes on tables, you should consider the following factors: • columns on which you join tables
• degree of denormalization for tables
• attribute elements on which you frequently filter reports
• number of indexes on tables
• disk storage configuration

First Time Here?

Welcome to Quizlet, a fun, free place to study. Try these flashcards, find others to study, or make your own.

Set Champions

There are no high scores or champions for this set yet. You can sign up or log in to be the first!