click below
click below
Normal Size Small Size show me how
Exam 1 Part 1
Intro to Data Science
Question | Answer |
---|---|
What are the roles of a data scientist? | engineer/developer, data engineer, database admin, big data specialist, researcher, analyst, manager in executive business, full stack data scientist (unicorn) |
What are the steps of a collaborative project? | planning, data preparation, modeling, follow up |
What is included in the 1st step of project? | define goals, organize resources, coordinate people, scheduling |
What is included in the 2nd step of the project? | find, clean, explore, and refine data |
What is included in the 3rd step of the project? | create, validate, and evaluate the model |
What is included in the 4th step of the project? | present, deploy, and revisit the model. Archive assets. |
How would you compare the importance of technical vs. domain skills? | Technical deals with the programming, database and systems administration, cloud management, big and distributed data. While domain deals with the management side, project management, budgeting, business development, governance and compliance. |
What is velocity in big data? | Data is being generated very quickly. Must decide how to process the data, and whether to keep or toss the data. |
What is the overlap between big data and data science? | Volume, velocity, and variety |
Why is Python and R the most used languages for data science? | Open-source, ease-of-use, simple syntax with small learning curve, variety of pre-built packages and libraries tailored to data science analysis and visualization |
What is the difference between a data scientist and a statistician? | DS is a multidisciplinary field that joins statistics with programming in an applied setting |