Select Category
All Categories
Accounting
Agriculture
Astrobiology
Banking & Finance
Business World
Civilizations
Commerce & Economics
Competitive exams
Computer Science & IT
Current affairs
Earth Science
Education
Environment
Explore the Animal World
Food chain
General Knowledge
Geography Map
Green Life
History
Human Health
Journalism
Law
Science
Social Science
TAX
About Us
Contact Us
Login
Data Mining - Part 1
1
of
25
💡
Hints:
3
Q1. Data mining is ___
A. A time variant non-Volatile collec tion of data
B. The actual discovery phase of a Knowledge
C. The stage of selecting the right data
D. None of these
Q2. In statistical distribution the outlier is identified using ___
A. working hypothesis
B. discontency test
C. alternative hypothesis
D. none of the above
Q3. ___ often used for both the prelimi nary investigation of the data and the f inal data analysis
A. Aggregation
B. Feature creation
C. Sampling
D. Attribution transformation
Q4. Ordinal is an example of ___
A. Ratio class
B. Interval class
C. Categorical class
D. Range class
Q5. k-means algorithm is sensitive to outliers
A. TRUE
B. FALSE
Q6. A collection of integrated, subject oriented databases designed to support the decision-support functions
A. Database
B. Data Collection
C. Data Warehouse
D. Data retrieval
Q7. ___ refers to the grouping of records, observations, or cases into classes of similar objects
A. Clustering
B. Grouping
C. Classification
D. Gathering
Q8. Which of the following is not involve in data mining
A. Data archaeology
B. Knowledge extraction
C. Data transformation
D. Data exploration
Q9. a data warehouse can include
A. flat-files
B. database table
C. online data
D. all
Q10. Of the following which is not a distance based clustering algorithms?
A. K-Means
B. K-Medoids
C. BIRCH
D. DBSCAN
Q11. The mass of the beaker was 122 g.
A. Qualitative Data
B. Quantitative Data
Q12. DBSCAN has a drawback over OPTICS
A. fixed sized radiius
B. fixed sized points in radiius
C. core points
D. none of above
Q13. This is a term that describes the large volume of data; both structured and unstructured.
A. Big Data
B. Big Knowledge
C. Big Information
D. Data at rest
Q14. Same person with multiple email addresses is an example of ___
A. Noise
B. Outliers
C. Missing values
D. Duplicate data
Q15. Find the median of these numbers:4,2,7,4,3
A. 2
B. 5
C. 7
D. 4
Q16. Select the tools that can be used for data mining
A. KNIME
B. WEKA
C. RATTLE
D. All
Q17. Is Logistic regression a supervised machine learning algorithm?
A. TRUE
B. FALSE
Q18. Of the following what are the distance based clustering algorithms?
A. K-Means
B. K-Medoids
C. Hierarchical
D. All
Q19. Suppose a cluster contain the points (1, 3), (3, 3), (2, 1). What is the centroid of the cluster?
A. (2, 2.33)
B. (2.33, 2)
C. (2, 3)
D. None
Q20. ___ is the output of KDD
A. Query
B. Useful Information
C. Data
D. Information
Q21. The view over an operational data warehouse is known as virtual warehouse
A. TRUE
B. FALSE
Q22. ___ refers to the mapping or classifi cation of a class with some predefined group or class.
A. Data Discrimination
B. Data Characterization
C. Data Definition
D. Data Visualization
Q23. Application server and data server are kept separately in.
A. Peer to Peer based Processing
B. Master slave based Processing
C. Host based Processing
D. 3-Tier Client Server model
Q24. Lift dari rule K ___ E adalah ___
×
A. 0.4
B. 0.6
C. 0.8
D. 1
Q25. Select the skills mainly required as a competent data analyst/scientist/miner
A. SQL
B. R
C. Java
D. All
Submitting Your Quiz...
Please wait while we process your answers