|
1 |
What is Normalization? Explain various methods for data normalization. |
|
2 |
What is Data Integration? |
|
3 |
Why do we pre-process the
data? |
|
4 |
What are the steps involved
in data pre-processing? |
|
5 |
Discuss the issues to be considered during data
integration? |
|
6 |
Describe the different methods for data cleaning. |
|
7 |
Explain how to handle noisy data? |
|
8 |
Explain Data smoothing and binning methods for data
smoothing. |
|
9 |
What is meant by dimensionality reduction? Discuss any
2 methods. |
|
10 |
What do you mean by numerosity reduction? Explain
various methods to achieve it. |
|
11 |
Briefly explain methods of concept hierarchy generation
for categorical data. |
|
12 |
Explain sampling methods for data reduction. |
|
13 |
What is Binning? List and
explain binning strategies. |
|
14 |
What is Data transformation? Briefly explain
various steps of data transformation. |
|
15 |
Explain Data cleaning as
two-step process. |
|
16 |
Explain the purpose of
correlation analysis. Explain how to find correlation between two numeric
attributes and categorical attributes. |
|
17 |
In real-world data, tuples
with missing values for some attributes are a Common occurrence. Describe
various methods for handling this problem. |
|
18 |
List out strategies for data
reduction. |
|
19 |
Explain data cube aggregation
of data reduction. |
|
20 |
Explain Attribute subset
selection technique of data reduction. |
|
21 |
What is Discretization? Why it is used? Explain types
of discretization. |
|
22 |
What
is concept Hierarchy? |
|
23 |
Explain methods for constructing concept hierarchy for
numeric attribute based on data discretization. |
|
24 |
Explain methods for constructing concept hierarchy for
categorical attribute based on data discretization. |
|
25 |
Define the following terms: Mean, Median, Mode, Range, Five Number Summary, Inter
Quartile Range, Variance, Standard Deviation, Outlier, kth
Percentile |
|
26 |
Explain different Graphics display methods of basic
descriptive data summaries : Box Plot, Histogram, Scatter Plot, Quantile
plot, Quantile-quantile plot, Loess Curve |
Unit:1
|
27 |
Differentiate between data, information &
knowledge. |
|
28 |
Define Business Intelligence. |
|
29 |
Define Data Warehouse. |
|
30 |
Explain common functions of Business Intelligence
technologies. |
|
31 |
What is the relation between Data warehouse and BI. |
|
32 |
Explain components and elements of data warehouse. |
|
33 |
Explain components and elements of business
intelligence. |
|
34 |
Explain life cycle of data. |
|
35 |
Explain Data warehouse metadata. |
|
36 |
Explain various trends in data warehousing. |
Unit 2
|
37 |
Why have a separate data
warehouse from operational databases? |
|
38 |
Explain data mart. |
|
39 |
Differentiate data
warehouse and data mart. |
|
40 |
Differentiate
Operational Systems vs. Decision Support System(Informational system). |
|
41 |
What is Virtual Warehouse? |
|
42 |
List the types of OLAP server. |
|
43 |
Which one is faster, Multidimensional OLAP or Relational OLAP? |
|
44 |
How many dimensions are selected in Slice operation? |
|
45 |
How many dimensions are selected in dice operation? |
|
46 |
How many fact tables are there in a star schema? |
|
47 |
Explain types of data warehouse. (information processing, analytical processing, data
mining) |
|
48 |
Explain two approaches for integrating heterogenous databases?
(query-driven, update-driven) |
|
49 |
Explain three-tier data warehouse architecture. |
|
50 |
Explain Data Warehouse Models. (Virtual data warehouse, data mart,
enterprise warehouse) |
|
51 |
What is the difference
between dependent data warehouse and independent data warehouse? |
|
52 |
Briefly state different
between data ware house & data mart? |
|
53 |
What is
the benefit of data warehouse? |
|
54 |
Explain
the storage models of OLAP. |
|
55 |
Differentiate between
Data Mining and Data warehousing. |
|
56 |
Differentiate between Data warehousing and Business Intelligence. |
|
57 |
What is Data purging? |
|
58 |
What is Data scrubbing? |
|
59 |
What are CUBES? |
|
60 |
Differentiate between OLTP and OLAP. |
|
61 |
What is
the very basic difference between data warehouse and operational databases? |
|
62 |
How does a Data Cube help? |
|
63 |
Define dimension? |
|
64 |
What does Metadata Respiratory contain? |
|
65 |
Define metadata. |
|
66 |
What do you mean by Data Extraction? |
|
67 |
List the Schema that a data warehouse system can implements. |
|
68 |
List the functions of data warehouse tools and utilities. |
|
69 |
List the processes that are involved in Data Warehousing. |
|
70 |
What is Data Warehousing? |
|
71 |
What are different types of
cuboids? |
|
72 |
What are the forms of
multidimensional model? |
|
73 |
If there are n dimensions,
how many cuboids are there? |
|
74 |
List the typical OLAP
operations. |
|
75 |
Differentiate
between star schema and snowflake schema. |
|
76 |
What
is a fact table? |
|
77 |
What
is a dimension table? |
|
78 |
What
is a ETL process? |
|
79 |
What is aggregation? |
|
80 |
Explain methods for
indexing OLAP data. |
|
81 |
Define Apex cuboid, Base
cuboid. |
|
82 |
Explain starnet query model. |
|
83 |
Explain pros and cons of
top-down and bottom-up approaches for data warehouse development. |
|
84 |
How many cuboids will be
there in n-dimensional cube? |
|
85 |
Explain data cube
materialization. |
|
86 |
Explain Online analytical
mining. |
Unit 3
|
87 |
What are issues in data
mining? |
|
88 |
What are
the different problems that “Data mining” can solve? |
|
89 |
What is
Discrete and Continuous data in Data mining world? |
|
90 |
How does
the data mining and data warehousing work together? |
|
91 |
What is data
characterization? |
|
92 |
What is data
discrimination? |
|
93 |
What are two types of data
mining tasks? (Descriptive task,Predictive task) |
|
94 |
What are outliers? |
|
95 |
What do you mean by
evolution analysis? |
|
96 |
What do you mean by Time
Series analysis? |
|
97 |
What is Association Mining? |
|
98 |
What are the components of
data mining? |
|
99 |
What are data mining
techniques/functionalities? |
|
100 |
Define KDD. |
|
101 |
What is the use of Knowledge Base? |
|
102 |
Give the architecture of data mining system. |
|
103 |
Discuss the issues in data mining in detail. |
|
104 |
Describe the steps involved in KDD process. |
|
105 |
Discuss data
mining task primitives. |
|
106 |
Explain various data repositories on which data mining
techniques are applied. |
|
107 |
Explain architecture of data mining systems along with
components in architecture of data mining system. |
|
108 |
Describe multi-dimensional view of data mining
classification. |
|
109 |
Explain types of integration of data mining system with
DBMS or data warehouse system. |
Concept Description and Association Rule Mining
|
110 |
What are frequent patterns? |
|
111 |
What
is concept Hierarchy? |
|
112 |
Explain the Apriori algorithm. Also explain how the
association rules are generated from frequent item sets. |
|
113 |
What do you mean by closed frequent item set? What is
its application? Which are various searching methods for
it? |
|
114 |
Discuss why analytical data characterization is needed
and how it can be performed. Compare the result of two induction methods. 1) With relevance Analysis 2) Without relevance Analysis |
|
115 |
Explain different approaches
to mining multilevel association rules. |
|
116 |
Explain Market Basket
Analysis. |
|
117 |
Explain measures for
finding rule interestingness. (support, confidence) |
|
118 |
Explain various ways of
classifying frequent pattern mining. |
|
119 |
Explain methods for
improving the efficiency of Apriori
algorithm. |
Classification and Prediction
|
120 |
What is regression? |
|
121 |
Define classification. |
|
122 |
How do you choose best split while constructing a
decision tree? |
|
123 |
Explain the algorithm for constructing a decision tree
from training samples. |
|
124 |
Write Bayes theorem. |
|
125 |
Compare clustering and classification. |
|
126 |
Differentiate supervised and unsupervised learning. |
|
127 |
Explain machine learning. |
|
128 |
What is prediction? Discuss the use of regression
techniques for prediction? |
|
129 |
Compare association and classification. Briefly explain
associative classification with suitable example. |
|
130 |
Compare various attribute selection measures for
decision tree with suitable example. |
|
131 |
Define: supervised learning, training set, testing set,
accuracy of classifier, sensitivity, specificity, regression. |
|
132 |
Explain various methods of evaluating accuracy of
classifier. |
|
133 |
Why naïve Bayesian
classification is called “naïve”? Briefly outline the major idea of naïve
Bayesian classification. |
|
134 |
Write down short note on
Backpropagation |
|
135 |
Explain issues regarding
classification and prediction. (Preparing data for classification &
prediction, Comparing classification and prediction methods) |
|
136 |
Explain criteria according to
which classification and prediction methods can be compared? |
|
137 |
Why decision tree classifiers
are so popular? |
Data Mining for Business Intelligence Applications
|
138 |
Explain data mining application for balanced scorecard. |
|
139 |
Explain data mining application for fraud detection. |
|
140 |
Explain data mining application for Click stream mining. |
|
141 |
Explain data mining application for Market
Segmentation. |
|
142 |
Explain data mining application for retail industry. |
|
143 |
Explain data mining application for telecommunication
industry. |
|
144 |
Explain data mining application for banking and
finance. |
|
145 |
Explain data mining application for CRM. |
|
146 |
Explain data analytics life cycle. |
|
147 |
State of the practice in analytics role of data scientists |
|
148 |
What is spatial data
mining? |
|
149 |
What is multimedia data
mining? |
|
150 |
What are different types of
multimedia data? |
|
151 |
What is text mining? |
|
152 |
What do you mean by web
content mining? |
|
153 |
Define web structure mining
and web usage mining. |
|
154 |
Explain clustering. Explain Various methods for
clustering. |
|
155 |
Define big data. |
|
156 |
Explain distributed file system. |
|
157 |
Explain big data applications. |
|
158 |
Explain Hadoop Architecture. |
|
159 |
Explain algorithm for map
reduce. Solve Matrix-Vector Multiplication by Map Reduce. |
|
160 |
Explain Hadoop storage –
HDFS. |
No comments:
Post a Comment