Big Data Anlaytics: Unit 1 & 2 - Question Bank MCQ's

Download as pdf or txt
Download as pdf or txt
You are on page 1of 4

BIG DATA ANLAYTICS

Unit 1 & 2 – Question Bank


MCQ’s
Unit 1&2 – Choose the Best

1. Concerning the characteristics of big data, which of these is odd?


a) Variety b) Vastness. C) Velocity d)Volume
2. The feature of big data that refers to the type and nature of the data is...
a) Volume. b) Veracity c) Variety d) Variability
3. Concerning factory work and cyber-physical system, which of these is odd?
a) Cloud. B) Community. C) Customization. D) Control
4. Big data challenges include the following except...
a) Data analysis b) Data visualization. C) Data interpretation. d) Data storage
5. Multidimensional big data can be represented as...
a) Tensors. B) Data applications c) Databases. D) Cloud applications
6. Big data technologies include the following except...
a) Business intelligence. B) Cloud computing c) Databases. D) Machine learning
7. According to the 2011 McKinsey Global Institute report, big data includes...
a) A/B Testing. B) Data Learning. C) Machine Learning d) Business Intelligence
8. Just collecting and storing information isn't enough to produce real business value. data
analytics technologies are necessary to:
a) Formulate eye-catching charts and graphs
b) Extract valuable insights from the data
c) Integrate data from internal and external sources
9. What are the V’s of Big Data?
a) Volume b) Velocity c) Variety d) All the above
10. What are the main components of Big Data?
a) MapReduce b) HDFS c) YARN d) All of these
11. What are the different features of Big Data Analytics?
a) Open-Source b) Scalability c) Data Recovery d) All the above
12. Facebook Tackles Big Data With _______ based on Hadoop
a) Project Prism b) Prism c) ProjectData d) ProjectBid
13. What is a unit of data that flows through a Flume agent?
a) Record b) Event c) Row d)Log
14. Just collecting and storing information isn't enough to produce real business value. Big
data analytics technologies are necessary to:
a) Formulate eye-catching charts and graphs
b) Extract valuable insights from the data
c) Integrate data from internal and external sources
15. The method by which companies analyze customer data or other types of information in an
effort to identify patterns and discover relationships between different data elements is
often referred to as:
a) Data mining b) Data digging c) Customer data management
16. What is the recommended best practice for managing big data analytics programs?
a) Adopting data analysis tools based on a laundry list of their capabilities
b) Letting go entirely of "old ideas" related to data management

Unit 1 & 2 - QB – BDA (IV CSE & CIVIL) – JBIET – Dr.G. Arun Sampaul Thomas 1
c) Focusing on business goals and how to use big data analytics technologies to meet
them
17. Companies that have large amounts of information stored in different systems should begin
a big data analytics project by considering:
a) The creation of a plan for choosing and implementing big data infrastructure
technologies
b) The interrelatedness of data and the amount of development work that will be
needed to link various data sources
c) The ability of business intelligence and analytics vendors to help them answer business
questions in big data environments
18. What is the name of the programming framework originally developed by Google that
supports the development of applications for processing large data sets in a distributed
computing environment?
a) MapReduce b) Hive c) Zookeeper
19. The method by which companies analyze customer data or other types of information in an
effort to identify patterns and discover relationships between different data elements is
often referred to as:
a) Data mining b) Data digging c) Customer data management
20. What is the recommended best practice for managing big data analytics programs?
a) Adopting data analysis tools based on a laundry list of their capabilities
b) Letting go entirely of “old ideas” related to data management
c) Focusing on business goals and how to use big data analytics technologies to
meet them
21. Companies that have large amounts of information stored in different systems should begin
a big data analytics project by considering:
a) The creation of a plan for choosing and implementing big data infrastructure
technologies
b) The interrelatedness of data and the amount of development work that will be
needed to link various data sources
c) The ability of business intelligence and analytics vendors to help them answer
business questions in big data environments
22. What is the name of the programming framework originally developed by Google that
supports the development of applications for processing large data sets in a distributed
computing environment?
a) MapReduce b) Hive. C) Zookeeper
23. What is the default ordering of data pairs in HBase?
a) Big endian. B) Lexicographical. C) Little endian. D) Sequential
24. Filters in HBase can be applied to
a) Row keys. B) Column qualifiers. C) Data values
b) d) All of the above
25. According to analysts, for what can traditional IT systems provide a foundation when
they’re integrated with big data technologies.
a) Big data management and data mining
b) Data warehousing and business intelligence
c) Management of Hadoop clusters
d) Collecting and storing unstructured data
26. As companies move past the experimental phase with Hadoop, many cite the need for
additional capabilities, including:
a) Improved data storage and information retrieval
b) Improved extract, transform and load features for data integration

Unit 1 & 2 - QB – BDA (IV CSE & CIVIL) – JBIET – Dr.G. Arun Sampaul Thomas 2
c) Improved data warehousing functionality
d) Improved security, workload management and SQL support
27. What license is Hadoop distributed under?
a) Apache License 2.0. b) Mozilla Public License. c) Shareware d) Commercial
28. According to analysts, for what can traditional IT systems provide a foundation when
they’re integrated with big data technologies like Hadoop?
a) Big data management and data mining b) Data warehousing and business intelligence
c) Management of Hadoop clusters d) Collecting and storing unstructured data
29. All of the following accurately describe Hadoop, EXCEPT:
a) Open source b) Real-time c) Java-based. d) Distributed computing approach
30. __________ has the world’s largest Hadoop cluster.
a) Apple b) Datamatics c) Facebook. d) None of the mentioned

Unit 1&2 – True/False

1. Recommendation engines provide random recommendations. - False


2. Data scientists are continuously growing in demand. - True
3. Businesses can utilize information from big data to maintain a competitive advantage. -
True
4. The small size of big data poses a challenge for businesses. - False
5. Data analytics helps accountants to analyze data and detect fraud. - True
6. Auditors can reference data through photos and GPS location to verify a transaction. - True
7. IBM SPSS is one of the Data Analytics tool - True
8. The feature of big data that refers to the type and nature of the data is Veracity – False
9. Concerning factory work and cyber-physical system, Community is important - True
10. Data visualization is one of the Big data challenges. – True
11. Multidimensional big data can be represented as Cloud applications – False
12. Big data technologies include Machine learning – False
13. According to the 2011 McKinsey Global Institute report, big data includes Data learning –
True
14. Data Mining is the method by which companies analyze customer data or other types of
information in an effort to identify patterns and discover relationships between different
data elements. – True
15. Zookeeper is the name of the programming framework originally developed by Google that
supports the development of applications for processing large data sets in a distributed
computing environment. – False
16. Lexicographical is the default ordering of data pairs in HBase. – True
17. Apache License 2.0 license is Hadoop distributed under. – True
18. Facebook has the world’s largest Hadoop cluster. – True.
19. Visibility is one of the V’s of Big Data? – False
20. MapReduce is the only main component of Big Data – False
21. To maximize the benefits of big data analytics techniques, it's critical for organizations to
select the right tools and involve people who bring needed analytical skills to a project.
True.
22. A big data analytics strategy is often defined by the three V's -- volume, variety and
velocity -- which is helpful but ignores other commonly cited characteristics, such as
complexity and variability.- True.

Unit 1 & 2 - QB – BDA (IV CSE & CIVIL) – JBIET – Dr.G. Arun Sampaul Thomas 3
23. For organizations that aren't currently looking to do big data analytics, there is little or no
benefit to examining the data they're retaining and evaluating how it's being used. False
24. Big data is the same as ordinary data. – False
25. Big data is explained with 3Vs: Volume, Velocity, and Variety. - True
26. Big data Analytics deals with unstructured data representation. – True
27. Uploading a picture on Facebook generates big data. – True
28. Traditional database management tools can handle the size and complexity of big data. –
False
29. Netflix utilizes predictive analytics. – True
30. Data Science is the same as Data Analytics. - False

5 Marks (Essay Type)

Unit 1:
1. While implementing marketing strategy for a new product in your company,
Identify and list some limitations of structured data related to this work.
Hint:
• Explain about structured DBMS systems for Employee Payroll etc.,
• What tools are used (Example: Oracle)
• Disadvantages of Structured data, why going for Unstructured data.
2. Compare the Parallel computing Vs Distributed computing for big data.
3. Write about BASE concepts to provide data consistency.
4. As a HR Manager of a Company providing Big Data solutions to clients, what
characteristics would you look for while recruiting a potential candidate for the
position of a Data analyst.
Hint:
• List out the roles of the Data Analyst
• Skill set required for the Analyst (Like Programming, Analytic skills)
• Mention some Tools that are used by Data Analyst, how it is effect in the Industry

Unit 2:
1. List out the skills required for an Analyst
2. Discuss about the various points to be considered during Analysis process.
3. Express the roles of the IT and analytics team in Big data analytics project.
4. Explain about the Text Mining process?

Unit 1 & 2 - QB – BDA (IV CSE & CIVIL) – JBIET – Dr.G. Arun Sampaul Thomas 4

You might also like