Upgrade to remove ads
Chapter 6 Foundations of Business Intelligence: Databases and Information Management
Terms in this set (43)
using both relational and non-relational technology that are optimized for analyzing large datasets.
Each characteristic or quality describing a particular entity is called an
to describe these datasets with volumes so huge that they are beyond the ability of typical DBMS to capture, store, and analyze.
represents the smallest unit of data a computer can handle.
A group of bits, called a
is responsible for the specific policies and procedures through which data can be managed as an organizational resource.
also known as data scrubbing, consists of activities for detecting and correcting data in a database that are incorrect, incomplete, improperly formatted, or redundant.
capability to specify the structure of the content of the database. It would be used to create database tables and to define the characteristics of the fields in each table.
is an automated or manual file that stores definitions of data elements and their characteristics.
deals with the policies and processes for managing the availability, usability, integrity, and security of the data employed in an enterprise, with special emphasis on promoting privacy, security, data quality, and compliance with government regulations.
Data redundancy wastes storage resources and also leads to, where the same attribute may have different values.
Data manipulation language
DBMS includes tools for accessing and manipulating information in databases. Most DBMS have a specialized language called a _________ that is used to add, change, delete, and retrieve the data in the database.
is a subset of a data warehouse in which a summarized or highly focused portion of the organization's data is placed in a separate database for a specific population of users.
provides insights into corporate data that cannot be obtained with OLAP by finding hidden patterns and relationships in large databases and inferring rules from them to predict future behavior.
Data quality audit
is a structured survey of the accuracy and level of completeness of the data in an information system.
is the presence of duplicate data in multiple data files so that the same data are stored in more than one place or location.
is a database that stores current and historical data of potential interest to decision makers throughout the company.
is a collection of data organized to serve many applications efficiently by centralizing the data and controlling redundant data.
In close cooperation with users, the design group establishes the physical database, the logical relations among elements, and the access rules and security procedures. The functions it performs are called
Database management system (DBMS)
is software that permits an organization to centralize data, manage them efficiently, and provide access to the stored data by application programs.
DBMS resides on a dedicated computer called a ____________. The DBMS receives the SQL requests and provides the required data.
is a person, place, thing, or event on which we store and maintain information.
the relationship between the entities SUPPLIER, PART, LINE_ITEM, and ORDER
A grouping of characters into a word, a group of words, or a complete number (such as a person's name or age) is called a
a group of records of the same type is called a
When the field Supplier_Number appears in the PART table, it is called a
is an open source software framework managed by the Apache Software Foundation that enables distributed parallel processing of huge amounts of data across inexpensive computers.
relies primarily on a computer's main memory (RAM) for data storage.
specifies the organization's rules for sharing, disseminating, acquiring, standardizing, classifying, and inventorying information.
The field for Supplier_Number in the SUPPLIER table uniquely identifies each record so that the record can be retrieved, updated, or sorted.
Non-relational database management systems
use a more flexible data model and are designed for managing large data sets across many distributed machines and for easily scaling up or down.
The process of creating small, stable, yet flexible and adaptive data structures from complex groups of data is called
Online analytical processing (OLAP)
supports multidimensional data analysis, enabling users to view the same data in different ways using multiple dimensions.
is the unique identifier for all the information in any row of the table and this primary key cannot be duplicated.
refers to the coupling of data stored in files and the specific programs required to update and maintain those files such that changes in programs require changes to the data.
A group of related fields, such as the student's name, the course taken, the date, and the grade, comprises a
Relational database systems try to enforce __________ rules to ensure that relationships between coupled tables remain consistent.
The most popular type of DBMS today for PCs as well as for larger computers and mainframes is the
software is able to mine text comments in an e-mail message, blog, social media conversation, or survey form to detect favorable and unfavorable opinions about specific subjects.
Structured Query Language (SQL)
The most prominent data manipulation language today is
tools are now available to help businesses analyze these data.
Rows are commonly referred to as records, or in very technical terms, as
The discovery and analysis of useful patterns and information from the World Wide Web is called
THIS SET IS OFTEN IN FOLDERS WITH...
Chapter 1 Information Systems in Global Business T…
Chapter 3 Information Systems, Organizations, and…
Chapter 4 Ethical and Social Issues in Information…
Chapter 7 Telecommunications, the Internet, and Wi…
YOU MIGHT ALSO LIKE...
MIS: Ch. 6
MIS Chapter 6
Chapter 6 MIS
Chapter 6 - Foundations of Business Intelligence:…
OTHER SETS BY THIS CREATOR
Chapter 8 Securing Information Systems
Exam 1 - Chapters 1 thru 6 BUAD 683 Enter online b…
Chapter 1 Review: To help prepare for the common A…