Search for notes by fellow students, in your own course and all over the country.

Browse our notes for titles which look like what you need, you can preview any of the notes via a sample of the contents. After you're happy these are the notes you're after simply pop them into your shopping cart.

My Basket

You have nothing in your shopping cart yet.

Title: Big Data Analyst
Description: Big Data Analyst Roles and Responsibilities. At its core, Big Data analytics centers on uncovering and analyzing useful data like hidden trends and patterns so companies can make more informed business decisions and gain a competitive advantage.

Document Preview

Extracts from the notes are below, to see the PDF you'll receive please use the links above


“Compression”
...
The _________ codec from Google provides modest compression ratios
...

2
...

a) Snappy is licensed under the GNU Public License (GPL)
b) BgCIK needs to create an index when it compresses a file
c) The Snappy codec is integrated into Hadoop Common, a set of common utilities that
supports other Hadoop subprojects
d) None of the mentioned
Answer: c
Explanation: You can use Snappy as an add-on for more recent versions of Hadoop that
do not yet provide Snappy codec support
...
Which of the following compression is similar to Snappy compression?
a) LZO
b) Bzip2
c) Gzip
d) All of the mentioned
Answer: a
Explanation: LZO is only really desirable if you need to compress text files
...
Which of the following supports splittable compression?
a) LZO
b) Bzip2
c) Gzip
d) All of the mentioned
Answer: a
Explanation: LZO enables the parallel processing of compressed text file splits by your
MapReduce jobs
...
Point out the wrong statement
...

6
...

7
...

a)
...
gz
c)
...
g
Answer: b
Explanation: You can use the gunzip command to decompress files that were created by
a number of compression utilities, including Gzip
...
Which of the following is based on the DEFLATE algorithm?
a) LZO
b) Bzip2
c) Gzip
d) All of the mentioned
Answer: c
Explanation: gzip is based on the DEFLATE algorithm, which is a combination of LZ77
and Huffman Coding
...
__________ typically compresses files to within 10% to 15% of the best available
techniques
...

10
...

a) 128k
b) 256k
c) 24k
d) 36k
Answer: b
Explanation: LZO was designed with speed in mind: it decompresses about twice as fast
as gzip, meaning it’s fast enough to keep up with hard drive read speeds


Title: Big Data Analyst
Description: Big Data Analyst Roles and Responsibilities. At its core, Big Data analytics centers on uncovering and analyzing useful data like hidden trends and patterns so companies can make more informed business decisions and gain a competitive advantage.