Skip to main content

Important Question of Data mining

.

Unit 1.

  1. Define data mining ? Explain application of data mining.
  2. Explain KDD : with Diagram.
  3. Explain data mining system architecture.

Unit 2.

  1. Explain data mining functionalities.
  2. Explain OLAP operation:
    • Roll up (drill-up)
    • Roll down (drill-up)
    • SLICE
    • DICE
    • Pivot
  3. Write about data warehouse schemas :
    • Star Schema : characteristic, advantage, disadvantage
    • Snowflake Schema : Characteristic, advantage, disadvantage
    • Galaxy Schema : Characteristic, advantage, disadvantage
  4. Define data warehouse architecture.
  5. Difference between Data base and Data warehouse.
  6. Difference between Star schema and Snowflake schema.

Unit 3.

  1. Explain Data preprocessing and why it is important.
    • Data Cleaning:
    • Data Integration:
    • Data Transformation:
    • Data Reduction:
  2. What is discretization ? Explain concept of hierarchy generation.

Unit 4.

  1. Define clustering ? Explain K-means algorithm with example.
  2. Explain K-medoids algorithm with example.
  3. Difference between Agglomerative Approach and Divisive Approach.

Unit 5.

  1. Difference between Classification and Predication.
  2. Explain Naïve Bayes Algorithm ? How Naïve Bayes Algorithm works ?
  3. Explain concept of linear regression  and non-linear regression.

Unit 6.

  1. Write about Apriori algorithm.
    • Itemset Frequent Pattern
    2. Generating Association from Frequent Itemset 
    • FP-Growth
    3. Difference between FP growth algorithm and Apriori algorithm.

Unit 7.

  1. What is an IR model ?
  2. What are the component of IR model.
  3. Difference between information retrieval and data retrieval.
  4. Define Image Retrieval.
  5. Define Video Retrieval.

Comments

Popular posts from this blog

Important Questions of Data Structure and Algorithm (DSA)

 . 1. What is Data structure? Explain different operations to be performed on data structure. 2. Define stack as ADT. Convert P+Q-(R*S/T+U) - V*W into infix expression to postfix. 3. Define Queue. Explain its type with example. 4.  Difference between stack and Queue. 5. Difference between Linear Queue and circular Queue. 6. Write an algorithm to enqueue and dequeue data element in a circular queue.                              7. Define Linked List. Explain its type. How does double linked List is different from circular linked   List. 8. What is recursion and recursive function? write a recursive function to compute Fibonacci number. 9. What is an AVL tree ? Create an  AVL tree from the following data:      18, 12, 14, 8, 85, 25, 31, 24, 27 10. Define B-tree? How to insertion and deletions of elements can be done in a B-tree. 11. Create an B-tree from the following Data:  ...

Define LAN and WAN

 . Local Area Network (LAN)  A LAN is a network that is used for communicating among computer devices, usually within an office building or home.   LAN’s enable the sharing of resources such as files or hardware devices that may be needed by multiple users • • Is limited in size, typically spanning a few hundred meters, and no more than a mile.  Is fast, with speeds from 10 Mbps to 10 Gbps.  Requires little wiring, typically a single cable connecting to each device.  Has lower cost compared to MAN’s or WAN’s.  LAN’s can be either wired or wireless. Twisted pair, coax or fiber optic cable can be used in wired LAN’s. Advantages of LAN :    The data is transferred at an extremely faster rate in local Area Network.  Local area network (LAN) provides higher security. Disadvantages of LAN :   Initial cost of installing local area network is quite high.  Unauthorized user can access critical data of an organization in case LAN ad...

Introduction of Computer

.  A computer is an electronic device. It take input and store the data in memory and performing the function to produce accurate result in output device. It is used to type document, send email, play game, browse the web and entertainment.   Characteristics of computer  1 . High speed  Computer is very fast device. It is capable of performing calculation of very large amount of data. The computer has unit of speed in microsecond, nanosecond and even in picosecond. The computer is capable of performing millions of tasks per second. 2. Accuracy  The computer produces highly accurate and reliable result.  It does not make any kind of mistake in calculating. The calculation are 100% error free. The computers perform accurate 'n' number of times. 3. Storage capability A computer has much more storage capability. It can store large amount of data. It can store any type of data such as image, video, text document, audio and many more. 4. Diligence Diligence ...