lOMoARcPSD|36673638
Data Science Multiple Choice Question
Data Science (K.L.E. Dr. M.S. Sheshgiri College of Engineering and
Technology)
Scan to open on Studocu
Downloaded by Ramya p
Studocu is not sponsored or endorsed by any college or university
Downloaded by Ramya p
Department of CS/IT
TYCS – SEM 6 - DATA SCIENCE
MCQ Model Questions
Choose the correct option:
1. Which of the following would be more appropriate to be replaced with question mark in
the following figure?
a) Data Analysis b) Data Science c) Descriptive Analytics d) Commerce
2. Which of the following is the most important language for Data Science?
a) Java b) Ruby c) R d) Basic
3. Which of the following is the common goal of statistical modelling?
a) Inference b) Summarizing c) Subsetting d) script
4-------------------------Shows all individual data points.
a) Box-plot b) Scatter Plot c) Line plot d) Pie chart
5. Xquery is a functional query language used to retrieve information stored in -----------
format.
a) HTML b) XML c) UML d) Jscript
6. Xpath specification has--------------------------types of nodes
a) Fourb) Five c) Six d) Seven
7. Data Visualization is also on element of the broader ------------------------
a) deliver presentation architecture b) data presentation architecture
c) dataset presentation architecture c) data process architecture
Downloaded by Ramya p
8. Which method shows hierarchical data in a nested format?
a) tree maps b) Scatter Plots c) Population pyramids d) Area Charts
9. Which of the following is most basic and commonly used techniques?
a) line charts b) Scatter plots c) Population pyramids d) Area charts
10. Which of the following is not a part of data science process?
a) discovery b) model planning c) communication building d) operationalize
11. In Xquery symbol preceded before the variable name.
a) @ b) $ c) # d) *
12. MongoDB support cross platform and is written in language.
a) C++ b) R c) Java d) Python
13. MongoDB is database.
a) SQLb) NoSQL c) RDBMS d) DBMS
14. Ridge Regression is when data suffers from _
a) Collinearity b) Multicollinearity c) Does not suffer d) Regression
15. Bayesian information Criterion (BIC) is related to
a) Ridge regression b) AIC c) Cross validation d) Lasso Regression
16. Joins are used for combining product.
a) Vector b) Cartesian c) Scalar d) Euler
17. Which of the following step is performed by data scientist after acquiring the data?
a) Data Cleansing b) Data Integration c) Data Replication d) Deletion
Downloaded by Ramya p
18. Which of the following package is used for reading excel data?
a) xlsx b) xlsc c) read.sheet d)VB
19. Which of the following is another name for raw data?
a) destination data b) eggy data c) secondary d) Machine Learning
20. Arranging the customers names in ascending order is an example of
a) process b) information processing c) process d) information
21. Organisation, distribution and manipulation of information is classified as
a) data manipulating b) process selection
c) information extraction d) information processing
22. Quantitative data deals with _
a) numbers and things b) Characteristics c) images d) sketches
23. Qualitative data deals with
a) Characteristics b) numbers c) things d) price
24. Example for discrete data
a) The number of children b) height of children
c) weight of children d) behaviour of children
25. Primary data is
a) Collected for the first time b) Collected for the second time
c) Not original data d) statistical operations have been performed.
26. The use of tabular data and graphs and charts makes it to understand the
concept of bar charts and histograms.
a) easy b) difficult c) boring d) confusing
Downloaded by Ramya p
27. This language was developed by Dennis Ritchie of Bell Laboratories in order to
implement the operating system UNIX.
a) C b) C++ c) Java d) LISP
28. Computer programs are written in a high level programming language; however, the
human-readable version of a program is called ………….
a) cache b) Instruction set c) source code d) word size
29. Query language comes under:
a) Third generation b) Fourth generation c) Fifth generation d) First Generation
30. Bitmapped file formats can be most useful for
a) Plots that may need to be resized
b) Plots that require animation or interactivity
c) Plots that are not scaled to a specific resolution
d) Scatterplots with many many points
31. The stem and leaf displaying technique is used to present data in
a) descriptive data analysis b) exploratory data analysis
c) nominal data analysis d) ordinal data analysis
32. Example for semi structured data_
a) XML data b) Relational data c) media logs d) word
33. Example for Unstructured data
a) media logs b) XML data c) Relational data d) Oracle
34. Which of the following is not a NoSQL database?
a) SQL Server b) MongoDB c) Cassandra d) C
Downloaded by Ramya p
35. Which of the following is a NoSQL Database Type?
a) SQLb) Document Database c) JSON d) C++
36. NoSQL databases is used mainly for handling large volumes of data.
a) unstructured b) structured c) semi-structured d) images
37. The government and non government publications are considered as
a) external secondary data sources b) internal secondary data sources
c) external primary data sources d) internal primary data sources
38. Amazon web services falls into which of the following cloud-computing category?
a) Platform as a Service b) Software as a Service
c) Infrastructure as a Service d) Back-end as a Service
39. The _ is a symbolic representation of facts or ideas from which information can
potentially be extracted.
a) knowledge b) data c) algorithm d) program
40. Data mining is used to refer stage in knowledge discovery in database.
a) Selection b) retrieving c) discovery d) coding
41. A collection of interesting and useful patterns in database is called _ .
a) knowledge b) information c) data d) algorithm
42. analysis divides data into groups that are meaningful, useful, or both.
a) cluster b) text c) multimedia d) link
Downloaded by Ramya p
43. Data dictionary is
a) Large collection of data mostly stored in a computer system
b) The removal of noise errors and incorrect input from a database
c) The systematic description of the syntactic structure of a specific database. It describes the
structure of the attributes the tables and foreign key relationships.
d) image
44. Data cleaning is
a) Large collection of data mostly stored in a computer system
b) The removal of noise errors and incorrect input from a database
c) The systematic description of the syntactic structure of a specific database. It describes the
structure of the attributes the tables and foreign key relationships.
d) Decision support systems
45. E-R model uses this symbol to represent weak entity set?
a) Dotted rectangle b) Diamond c) Doubly outlined rectangle d) Square
46. Relational Algebra is
a) Data Definition Language b) Meta Language
c) Procedural query Language d) BASIC
47. What is a relationship called when it is maintained between two entities?
a) Unary b) Binary c) Ternary d)Quaternary
48. The RDBMS terminology for a row is
a) Tuple b) Relation c) Attribute d) Degree
49. CouchDB is
a) Document-oriented DBMS b) Relational DBMS
Downloaded by Ramya p
c) Compiler d) Interpreter
50. can be used for batch processing of data and aggregation operations.
a) Hive b) MapReduce c) Oozie d) PASCAL
Downloaded by Ramya p