Data: (with Big Data and Database Semantics)
Research output: Contribution to journal › Journal article › Research › peer-review
Standard
Data : (with Big Data and Database Semantics). / Hjørland, Birger.
In: Knowledge Organization, Vol. 45, No. 8, 12.2018, p. 685-708.Research output: Contribution to journal › Journal article › Research › peer-review
Harvard
APA
Vancouver
Author
Bibtex
}
RIS
TY - JOUR
T1 - Data
T2 - (with Big Data and Database Semantics)
AU - Hjørland, Birger
N1 - Publikationen er også publiceret online: http://www.isko.org/cyclo/data
PY - 2018/12
Y1 - 2018/12
N2 - It is argued that data should be defined as information on properties of units of analysis. Epistemologically it is important to establish that what is considered data by somebody need not be data for somebody else. This article considers the nature of data and “big data” and the relation between data, information, knowledge and documents. It is common for all these concepts that they are about phenomena produced in specific contexts for specific purposes and may be represented in documents, including as representations in databases. In that process, they are taken out of their original contexts and put into new ones and thereby data loses some of or all their meaning due to the principle of semantic holism. Some of this lost meaning should be reestablished in the databases and the representations of data/documents cannot be understood as a neutral activity, but as an activity supporting the overall goal implicit in establishing the database. Utilizing (big) data (as it is the case with utilizing information, knowledge and documents) demands first of all the identification of the potentials of these data for relevant purposes. The most fruitful theoretical frame for knowledge organization and data science is the social epistemology suggested by Shera (1951). One important aspect about big data is that they are often unintentional traces we leave during all kinds of activities. Their potential to inform somebody about something is therefore less direct as compared to data that have been produced intentionally as, for example, scientific databases.
AB - It is argued that data should be defined as information on properties of units of analysis. Epistemologically it is important to establish that what is considered data by somebody need not be data for somebody else. This article considers the nature of data and “big data” and the relation between data, information, knowledge and documents. It is common for all these concepts that they are about phenomena produced in specific contexts for specific purposes and may be represented in documents, including as representations in databases. In that process, they are taken out of their original contexts and put into new ones and thereby data loses some of or all their meaning due to the principle of semantic holism. Some of this lost meaning should be reestablished in the databases and the representations of data/documents cannot be understood as a neutral activity, but as an activity supporting the overall goal implicit in establishing the database. Utilizing (big) data (as it is the case with utilizing information, knowledge and documents) demands first of all the identification of the potentials of these data for relevant purposes. The most fruitful theoretical frame for knowledge organization and data science is the social epistemology suggested by Shera (1951). One important aspect about big data is that they are often unintentional traces we leave during all kinds of activities. Their potential to inform somebody about something is therefore less direct as compared to data that have been produced intentionally as, for example, scientific databases.
U2 - 10.5771/0943-7444-2018-8-685
DO - 10.5771/0943-7444-2018-8-685
M3 - Journal article
VL - 45
SP - 685
EP - 708
JO - Knowledge Organization
JF - Knowledge Organization
SN - 0943-7444
IS - 8
ER -
ID: 209653225