Term
Accumulo is based off BigTable (T/F) |
|
Definition
True. Accumulo is based off Google's BigTable technology. |
|
|
Term
Accumulo is a column-oriented database (T/F) |
|
Definition
True. Most SQL databases are row-oriented. |
|
|
Term
What are "locality groups"? |
|
Definition
A group of columns of data so analytical applications take advantage of related data. |
|
|
Term
Column Families need to be declared before being used (T/F) |
|
Definition
|
|
Term
What is a "column family"? |
|
Definition
related columns stored in consecutive key-value pairs |
|
|
Term
What is the goal of Accumulo? |
|
Definition
To provide a set of features that work well even as data sizes grow into the 10s of petabytes even in the presence of the regular failures expected of cheaper commodity-class hardware that is commonly used. |
|
|
Term
What is "Shared-Nothing Architecture"? |
|
Definition
Distributed system architecture using separate computers that do not share memory, CPUs, or hard drives. |
|
|
Term
What is another name for "Shared-Nothing Architecture"? |
|
Definition
|
|
Term
What is "Vertical Scaling"? |
|
Definition
Distributed computer systems where computers share CPU, memory, or hard drives. |
|
|
Term
Which architecture (Vertical or Horizontal) is more likely to use a Storage Area Network (SAN)? |
|
Definition
|
|
Term
Which architecture (Vertical or Horizontal) is more likely to use SATA versus SCSI drives? |
|
Definition
Horizontal. Horizontal architecture is not dependent on expensive hard drives. |
|
|
Term
Which architecture (Vertical or Horizontal) is more likely to use inexpensive hardware? |
|
Definition
|
|
Term
|
Definition
Hadoop Distributed Filesystem |
|
|
Term
|
Definition
In Accumulo, data is stored in tables and tables are partitioned into "tablets" |
|
|
Term
What are "tablet servers"? |
|
Definition
A server that hosts a number of tablets |
|
|
Term
Accumulo automatically finds out how to partition data (T/F) |
|
Definition
|
|
Term
If the incoming data causes a single tablet to become larger than others, what happens? |
|
Definition
Upon a configurable threshold, Accumulo splits the tablet into two. |
|
|
Term
Accumulo allows tablet servers to be responsible for tablets that are stored on another server (T/F) |
|
Definition
True, at least, temporarily. Over time, tablet servers will create local copies of the data in background operations to avoid reads over the network in response to client requests. |
|
|
Term
How is data stored in Accumulo? |
|
Definition
|
|
Term
How does Accumulo maintain performance? |
|
Definition
The keys are sorted at all times. |
|
|
Term
What does "hashing the key" mean? |
|
Definition
When the key is passed as the input to the hash function |
|
|
Term
Accumulo uses hashing algorithms to distribute data (T/F). |
|
Definition
False. Accumulo does not rely on hashing for data distribution, but rather, uses sorting. |
|
|
Term
What does the concept of "locality" mean? |
|
Definition
When data is sorted, it is easy to find related data because the data is sorted instead of hashed. |
|
|
Term
What is the "metadata table"? |
|
Definition
The "metadata table" is a mapping of which machine has which portion of the sorted set of key-value pairs. |
|
|