Big Data Anlaytics: Unit 1 & 2 - Question Bank MCQ's
Big Data Anlaytics: Unit 1 & 2 - Question Bank MCQ's
Big Data Anlaytics: Unit 1 & 2 - Question Bank MCQ's
Unit 1 & 2 - QB – BDA (IV CSE & CIVIL) – JBIET – Dr.G. Arun Sampaul Thomas 1
c) Focusing on business goals and how to use big data analytics technologies to meet
them
17. Companies that have large amounts of information stored in different systems should begin
a big data analytics project by considering:
a) The creation of a plan for choosing and implementing big data infrastructure
technologies
b) The interrelatedness of data and the amount of development work that will be
needed to link various data sources
c) The ability of business intelligence and analytics vendors to help them answer business
questions in big data environments
18. What is the name of the programming framework originally developed by Google that
supports the development of applications for processing large data sets in a distributed
computing environment?
a) MapReduce b) Hive c) Zookeeper
19. The method by which companies analyze customer data or other types of information in an
effort to identify patterns and discover relationships between different data elements is
often referred to as:
a) Data mining b) Data digging c) Customer data management
20. What is the recommended best practice for managing big data analytics programs?
a) Adopting data analysis tools based on a laundry list of their capabilities
b) Letting go entirely of “old ideas” related to data management
c) Focusing on business goals and how to use big data analytics technologies to
meet them
21. Companies that have large amounts of information stored in different systems should begin
a big data analytics project by considering:
a) The creation of a plan for choosing and implementing big data infrastructure
technologies
b) The interrelatedness of data and the amount of development work that will be
needed to link various data sources
c) The ability of business intelligence and analytics vendors to help them answer
business questions in big data environments
22. What is the name of the programming framework originally developed by Google that
supports the development of applications for processing large data sets in a distributed
computing environment?
a) MapReduce b) Hive. C) Zookeeper
23. What is the default ordering of data pairs in HBase?
a) Big endian. B) Lexicographical. C) Little endian. D) Sequential
24. Filters in HBase can be applied to
a) Row keys. B) Column qualifiers. C) Data values
b) d) All of the above
25. According to analysts, for what can traditional IT systems provide a foundation when
they’re integrated with big data technologies.
a) Big data management and data mining
b) Data warehousing and business intelligence
c) Management of Hadoop clusters
d) Collecting and storing unstructured data
26. As companies move past the experimental phase with Hadoop, many cite the need for
additional capabilities, including:
a) Improved data storage and information retrieval
b) Improved extract, transform and load features for data integration
Unit 1 & 2 - QB – BDA (IV CSE & CIVIL) – JBIET – Dr.G. Arun Sampaul Thomas 2
c) Improved data warehousing functionality
d) Improved security, workload management and SQL support
27. What license is Hadoop distributed under?
a) Apache License 2.0. b) Mozilla Public License. c) Shareware d) Commercial
28. According to analysts, for what can traditional IT systems provide a foundation when
they’re integrated with big data technologies like Hadoop?
a) Big data management and data mining b) Data warehousing and business intelligence
c) Management of Hadoop clusters d) Collecting and storing unstructured data
29. All of the following accurately describe Hadoop, EXCEPT:
a) Open source b) Real-time c) Java-based. d) Distributed computing approach
30. __________ has the world’s largest Hadoop cluster.
a) Apple b) Datamatics c) Facebook. d) None of the mentioned
Unit 1 & 2 - QB – BDA (IV CSE & CIVIL) – JBIET – Dr.G. Arun Sampaul Thomas 3
23. For organizations that aren't currently looking to do big data analytics, there is little or no
benefit to examining the data they're retaining and evaluating how it's being used. False
24. Big data is the same as ordinary data. – False
25. Big data is explained with 3Vs: Volume, Velocity, and Variety. - True
26. Big data Analytics deals with unstructured data representation. – True
27. Uploading a picture on Facebook generates big data. – True
28. Traditional database management tools can handle the size and complexity of big data. –
False
29. Netflix utilizes predictive analytics. – True
30. Data Science is the same as Data Analytics. - False
Unit 1:
1. While implementing marketing strategy for a new product in your company,
Identify and list some limitations of structured data related to this work.
Hint:
• Explain about structured DBMS systems for Employee Payroll etc.,
• What tools are used (Example: Oracle)
• Disadvantages of Structured data, why going for Unstructured data.
2. Compare the Parallel computing Vs Distributed computing for big data.
3. Write about BASE concepts to provide data consistency.
4. As a HR Manager of a Company providing Big Data solutions to clients, what
characteristics would you look for while recruiting a potential candidate for the
position of a Data analyst.
Hint:
• List out the roles of the Data Analyst
• Skill set required for the Analyst (Like Programming, Analytic skills)
• Mention some Tools that are used by Data Analyst, how it is effect in the Industry
Unit 2:
1. List out the skills required for an Analyst
2. Discuss about the various points to be considered during Analysis process.
3. Express the roles of the IT and analytics team in Big data analytics project.
4. Explain about the Text Mining process?
Unit 1 & 2 - QB – BDA (IV CSE & CIVIL) – JBIET – Dr.G. Arun Sampaul Thomas 4