You are on page 1of 32

Number: 12

Linked Data
for Electronic Theses and
Dissertations of
the Thai Academic Digital Collection

Yuttana J.
Ph.D. Student, Doctor of Philosophy program in Information Studies

22/05/2019

Last updated, 1st semester of academic year 2019 1


Agenda
Quick stories of TDC

Research problems

Current progress

Future Directions

2
Background
Intellectual Property of
Thailand Universities

Thailand Academic
Digital Collections

Transformation of
Information Technology information resources
+ Digital Media &
Storage

Digital Collections
categorized by institutes, Networking
area of study and object
types 3
About Thai Academic Digital Collection

founded by the
Commission of
Higher Education
of Thailand (CHE)
in 2000
managed by
members of
to support the ThaiLIS (Thailand
academic libraries Integrated Library
in creating digital System consortium)
collections of the
institutional
publications

4
TDC Timeline & members

2000 2012

CU e-thesis
integrated

DCMS initiative
(24)

2004 2017

TDC TDC Phase 2


(114 universities) (more than 174 institutes)
Agenda
Quick stories of TDC

Research problems

Current progress

Future Directions
Research problems

OPERATIONS DISCOVERY
the operation problems: lack of tool for
acquisitions, digitization, information discovery
metadata, right
management and IR

DEAD LINK INFORMATION RETRIEVAL


lack of accurate link the main problem is IR
between digital objects which directly effected
on the end users

(Schultz, 2014 ; NDLTD, 2015 ; ThaiLIS, 2015)


7
Why Linked Data?

01 04
DATA EXCHANGE
INTER CO-OPERABILITY
linked data is a generic, native
way of data exchange. simplifies data exchange with
repositories for everyone
outside of the repository
02 environment
05
REPOSITORIES
SEMANTIC NETWORK
not limited to the field of
SPARQL endpoints allow
repositories
searches within the contents
of external repositories
03 06
SELF DESCRIPTIVE
WEB STANDARD
data published following the
use standard web technology
Linked Data Principles is self-
for implementation
descriptive

(Miiler, 2002 : Schilling, 2012 ; Candela, 2015)


8
Research Objectives
Expected Outcomes
Linked Data for Thai
Academic Digital Collections
with 5-stars bibliographic
linked data rank that can
contribute systemically and
effectively.
Linked Data of TDC
3. To develop Linked Data
for the ETD collections of
TDC Step 03
ETD Datasets
2. To design the dataset
Step 02 structure for the electronic
Life-cycle management theses and dissertations
(ETD) collections of TDC
1. To identify the current
situations of the life cycle
management of digital
Step 01
collections in Thai
universities

9
Research framework
1. Current Conditions of ETDs 3. Linked Data for Thai
life cycle management of TDC Academic Digital Collections
T  controlled vocabularies
 Project Initiative 2. Analyze and design
D  machine-readable explicit
 Creation, Submit & Ingest C
for dataset structure of
of the vocabulary
 Access & Retrieve TDC
 vocabulary is linked to
 Archiving & Preservation f  NISO Digital
other vocabularies
 Evaluation & Assessment r Collection Principles
 Metadata about the
a  Digital Libraries
vocabulary is available
m reference models
 vocabulary is linked to by
e  Information Life
other vocabularies
TDC issues & requirements w Cycle Management
 Acquisition & Gathering o  5-stars bibliographic
 Digitization r linked data
 Metadata standards k  SKOS Linked Data Evaluation
 Right Management s  OWL & SPARQL  RDF & ingestion engine
 Storage & Retrieval  Search & Browse
 Content Management
 User interface
 Inter-operation
Agenda
Quick stories of TDC

Research problems

Current progress

Future Directions
Research Objective 1

12
Research Objective 1
Program planning Create & Ingest Access & Retrieve
 No policies / authority  Tools & checklist online  Retrospectively reformatted
TDC requirements & issues

 Institute strategic planning  Web services  Web OPAC linkage


 Budgets  Academic library’s role  Search engine friendly
 Role of ETDs committee  Right management  System inter-operability
 Lack of awareness  ETD metadata standard

Preservation Evaluation Co-operation


 Objects preservation &  No standard indicator or  Network infrastructures
curation KPI for evaluation  Main software for consortium
 Information organization &  Only for service, no  Administration & advisory
classification problems evaluation process for
infrastructure or whole  Copyright
 Master & delivery files program  Authority & budget
13
TDC requirements & issues
Research output 1

14
(1) Providing a rationale of ETD program
ETD program
(2) Proposing for budgets & support
planning
(3) Proposing program implementation

ETD life-cycle
management
Research Objective 2

16
Research output 2-01

The core system


from M.D.O.E.N requirements
17
Research output 2-01

18
Research output 2-01

19
Research output 2-02

Metadata schemas and metadata standards


for ETDs indexes and digital objects management 20
Research output 2-02

21
Research output 2-03

1. 71,992 records of electronics theses and


dissertations datasets from 24 universities in
Thailand

2. 73,270 records of all authors in database


(Personal name and Corporate name, not include
conference name yet)

3. 59,309 records of All Keywords in database (not


categorized to Facet Application of Subject
Terminology yet)
22
Research output 2-03

Author & Subject records 23


Research output 2-04

ETDs bibliographic datasets


24
Agenda
What is the TDC?

How was I start?

What am I doing?

Future Directions

25
Research Objective 3

26
Research output 3-01

Analyze Linked Data schema by use Ontology as tools27


Research output 3-02

Use words list and SKOS from Bibliographic Ontology28


Research output 3-04

Use words list and SKOS from other Linked Data 29


Future Directions

100% 100% 95% 60%

Complete In progress In progress N/A

Current ETDs Goal


Conditions Data Sets Linked Data
s

System

Policies Acquisition 8 Types


Specifications
Issues Digitization of FAST
Functional
Requirements Metadata
Requirements
Right Management
Databases
Retrieval
Future directions

1. the frameworks of academic digital collections


management for Thai universities
2. the system prototype that include of datasets and
data structures for the TDC Linked Data
3. Linked Data for Thai Academic Digital Collections
with 5-stars bibliographic linked data rank that can
contribute systemically and effectively. In additional it
is the tool for gathering, accessing, using and
preserving of the knowledge in Thai Academic Digital
Collection.
31
Thank you!
32

You might also like