In this week lecture, the lecturer explains the techniques and development model for data mining. These issues are pretty straight forward. However, the most interesting part of the lecture is the idea of Data Mining and Knowledge Discovery. These two terms are being use interchangeably in the industry. She said that academics tend to use the term of knowledge discovery and vendors tend to use the term data mining more often. Some also said that data mining is part of knowledge discovery. I certainly agree with the latter one.
As most techniques in the data mining are not new such as decision making tree, visualisation and statistic. These techniques have been use decades, if not centuries ago by mankind. These techniques mainly aim to assist people to looks at a certain thing differently and make prediction based on previous trends. In short, it is call precedent case judgment theory. This enables people to understand the large amount of historical data by arranging those data into, for example, statistic. Thus, by applying such technique in data mining, it means that data mining is actually part of knowledge discovery.
Knowledge discovery also encapsulate more than data mining. People can discover knowledge from various sources. Before going into knowledge discovery, I will discuss what knowledge is.
Data/Artefacts + Context --> Information + General Truth --> Knowledge
I might be wrong saying how things become knowledge in such a simple diagram but let say this is how we perceived knowledge is.
As we can see, knowledge is the end product of a chain. To gain new knowledge, we must have data or artefacts that surrounds by meaningful context to become information. For instance, a number 20 would not mean anything unless it is associated with a context. Thus, we can say 20 in a classroom might means that there are 20 people in a classroom or 20 tables in a classroom. However, having information alone would not make new knowledge as information that does not associate to what other are currently exist could not put into practice. For example, 20 people in a classroom would be a piece of information. However, we would not be able to use that information unless we are told what to do or what can be done. Therefore, only information that associated with general truth can become knowledge. Therefore, we can say that knowledge would be tools or method that enables us to carry out a task appropriately.
In order to discover new knowledge, we must first discover data or artefacts. Discovering new data or artefacts can be done in different ways. For instance, walking through a park and observe what are there. Likewise, driving a golf ball and understand how far it can be hit. Some of these knowledge discovery methods are not directly associated with data mining or cannot be done in data mining. Other example that can be done using data mining would be looking at historical sales data to predict future trend. This means discovering knowledge through mathematical or systematical.
In short, data mining can be seen as part of knowledge discovery but not knowledge discovery. This is because knowledge can be gain through different methods and data mining is just one of those methods.