Save
Busy. Please wait.
Log in with Clever
or

show password
Forgot Password?

Don't have an account?  Sign up 
Sign up using Clever
or

Username is available taken
show password


Make sure to remember your password. If you forget it there is no way for StudyStack to send you a reset link. You would need to create a new account.
Your email address is only used to allow you to reset your password. See our Privacy Policy and Terms of Service.


Already a StudyStack user? Log In

Reset Password
Enter the associated with your account, and we'll email you a link to reset your password.
focusNode
Didn't know it?
click below
 
Knew it?
click below
Don't Know
Remaining cards (0)
Know
0:00
Embed Code - If you would like this activity on your web page, copy the script below and paste it into your web page.

  Normal Size     Small Size show me how

Compsci #10

From 12 [big data]

TermDefinition
Big data Ex volume of data at Google, NASA, etc
The Vs of Big Data Main ones - Volume, Variety, Velocity. Other - Veracity, Variability, Value
Big data challenges Difficult to effectively and efficiently capture, store, and analyze big data. Also new breeds of tech are needed
Big data considerations Platform limitations, data arriving too fast for the platform to handle, etc
Success factors for big data analytics A clear business need, strong sponsorship, alignment between the business and IT strategy
In memory analytics Storing and processing the complete data set in RAM
In database analytics Placing analytic procedures close to where data is stored
Grid computing & MPP Use of many machines and processors in parallel (MPP - massively parallel processing)
Appliances Combining hardware, software, and storage in a single unit for performance and scalability
Data integration The ability to combine data quickly and at reasonable costs
Processing capabilities The ability to process the data quicky, as it is captured (ex stream analytics)
MapReduce MapReduce processes large, complex data files by distributing tasks across many simple computers to achieve high performance
Hadoop Open-source framework for storing and analyzing massive unstructured data using low-cost hardware for easy scaling
How Hadoop works Uses HDFS(file system) to split data into parts across nodes on commodity hardware, with one node as Facilitator and another as Job Tracker
Hadoop and data warehousing Hadoop handles unstructured, large-scale data, while data warehouses work with structured, processed data. Together, Hadoop can store raw data, and the warehouse can analyze it after processing
MapReduce vs Hadoop Related but not the same. MapReduce provides control for analytics, but it is not an analytic. Hadoop is about data diversity, not just data volume
Created by: jolly_n4
Popular Computers sets

 

 



Voices

Use these flashcards to help memorize information. Look at the large card and try to recall what is on the other side. Then click the card to flip it. If you knew the answer, click the green Know box. Otherwise, click the red Don't know box.

When you've placed seven or more cards in the Don't know box, click "retry" to try those cards again.

If you've accidentally put the card in the wrong box, just click on the card to take it out of the box.

You can also use your keyboard to move the cards as follows:

If you are logged in to your account, this website will remember which cards you know and don't know so that they are in the same box the next time you log in.

When you need a break, try one of the other activities listed below the flashcards like Matching, Snowman, or Hungry Bug. Although it may feel like you're playing a game, your brain is still making more connections with the information to help you out.

To see how well you know the information, try the Quiz or Test activity.

Pass complete!
"Know" box contains:
Time elapsed:
Retries:
restart all cards