Question 10
[6 Marks]
What are the best qualities to assessthe quality of data?
Question 11
[2 Marks]
What would you discover when performing Frequent ltemset Mining?
Question 12
[4 Marks]
Explain the difference between Data Cleaning and Data Integration.
Question 13
[6 Marks]
Outliers are often discarded as noise. However, one person's garbage could be another's
treasure. For example, exceptions in credit card transactions can help us detect the fraudulent
use of credit cards. Using fraudulence detection as an example, propose two methods, which
can be used to detect outliers and discuss which one is more reliable.
Question 14
[2 Marks]
Operational databases store huge amounts of data, and you may wonder, "Why not perform
on line analytical processing directly on such databases instead of spending additional time and
resources to construct a separate data warehouse?". What would the reason be for this?
Question 15
[5 Mark]
You are an experienced Data Miner who applies the disciplines of Data Science and Philosophy
together in tackling your daily duties. A junior Data Miner wants to understand the concept of
Data Science to which you are applying. How or what would you explain to the junior Data
Miner?
Question 16
[3 Marks]
You just joined a Data Mining Consulting Company as an Intern for five (5) months. During the
last week of your internship program, your supervisor asks you to explain the concept of
attribute construction during preprocessing. Please provide an example with the explanation.
Question 17
[12 Marks]
You are employed at Tech Mining Consultants as a Mining Engineer. Part of your duties is to
teach Data Mining Interns data mining concepts. During a meeting, one intern wants you to list
the most important data mining techniques. What are the different techniques used for data
mining? Please list and explain them.
3