Abstract
The delineation of coordinates is fundamental for the
cartography of science, and accurate and credible
classification of scientific knowledge presents a
persistent challenge in this regard. We present a map of
Finnish science based on unsupervised-learning
classification, and discuss the advantages and
disadvantages of this approach vis-à-vis those generated
by human reasoning. We conclude that from theoretical and
practical perspectives there exist several challenges for
human reasoning-based classification frameworks of
scientific knowledge, as they typically try to fit
new-to-the-world knowledge into historical models of
scientific knowledge, and cannot easily be deployed for
new large-scale data sets. Automated classification
schemes, in contrast, generate classification models only
from the available text corpus, thereby identifying
credibly novel bodies of knowledge. They also lend
themselves to versatile large-scale data analysis, and
enable a range of Big Data possibilities. However, we
also argue that it is neither possible nor fruitful to
declare one or another method a superior approach in
terms of realism to classify scientific knowledge, and we
believe that the merits of each approach are dependent on
the practical objectives of analysis.
| Original language | English |
|---|---|
| Pages (from-to) | 2464-2476 |
| Journal | Journal of the Association for Information Science and Technology |
| Volume | 67 |
| Issue number | 10 |
| DOIs | |
| Publication status | Published - 2016 |
| MoE publication type | A1 Journal article-refereed |
Keywords
- machine learning
- automatic classification
- text mining
- science
Fingerprint
Dive into the research topics of 'Map of science with topic modeling: Comparison of unsupervised learning and human-assigned subject classification'. Together they form a unique fingerprint.Projects
- 2 Finished
-
RAID: Radical and Incremental Innovation in Industrial Renewal
Suominen, A. (Participant) & Toivanen, H. (PI)
1/01/15 → 31/12/16
Project: Business Finland project
-
CEK: Co-evolution of knowledge creation systems and innovation pipelines
Suominen, A. (Participant) & Toivanen, H. (PI)
1/01/13 → 31/12/14
Project: Business Finland project
Cite this
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver