Professional Documents
Culture Documents
Abstract—We present an approach and a system to let tutors monitor several important aspects related to online tests, such as
learner behavior and test quality. The approach includes the logging of important data related to learner interaction with the system
during the execution of online tests and exploits data visualization to highlight information useful to let tutors review and improve the
whole assessment process. We have focused on the discovery of behavioral patterns of learners and conceptual relationships among
test items. Furthermore, we have led several experiments in our faculty in order to assess the whole approach. In particular, by
analyzing the data visualization charts, we have detected several previously unknown test strategies used by the learners. Last, we
have detected several correlations among questions, which gave us useful feedbacks on the test quality.
Index Terms—Distance learning, data visualization, interactive data exploration and knowledge discovery.
1 INTRODUCTION
Authorized licensed use limited to: TAGORE ENGINEERING COLLEGE. Downloaded on June 4, 2009 at 03:52 from IEEE Xplore. Restrictions apply.
COSTAGLIOLA ET AL.: MONITORING ONLINE TESTS THROUGH DATA VISUALIZATION 775
Authorized licensed use limited to: TAGORE ENGINEERING COLLEGE. Downloaded on June 4, 2009 at 03:52 from IEEE Xplore. Restrictions apply.
776 IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, VOL. 21, NO. 6, JUNE 2009
Authorized licensed use limited to: TAGORE ENGINEERING COLLEGE. Downloaded on June 4, 2009 at 03:52 from IEEE Xplore. Restrictions apply.
COSTAGLIOLA ET AL.: MONITORING ONLINE TESTS THROUGH DATA VISUALIZATION 777
Fig. 5. Basic elements of the chart. (a) Wrong response. (b) Right
response. (c) Browsing event. (d) Item view.
Authorized licensed use limited to: TAGORE ENGINEERING COLLEGE. Downloaded on June 4, 2009 at 03:52 from IEEE Xplore. Restrictions apply.
778 IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, VOL. 21, NO. 6, JUNE 2009
Authorized licensed use limited to: TAGORE ENGINEERING COLLEGE. Downloaded on June 4, 2009 at 03:52 from IEEE Xplore. Restrictions apply.
COSTAGLIOLA ET AL.: MONITORING ONLINE TESTS THROUGH DATA VISUALIZATION 779
space necessary for the storage of the images: for caching TABLE 1
purposes, only images obtained within the browser session Extract of the Data Matrix and Results
are saved on the server and are deleted at the end of the of the Response Changing Analysis
session. With this approach, the computation necessary to
construct each visualized chart is executed once per test
analysis session, even when the same chart is visualized
more times during the same session.
5 EXPERIMENTAL RESULTS
In order to demonstrate the effectiveness of the approach
and of the proposed system, we have used them in the
context of the Web Development Technologies course in our
faculty: the eWorkbook system [12], equipped with the new
module for tracking the learners’ interactions, has been
used to administer an online test to learners. They have
not been informed of the experiment; they just knew that experiment focuses on the strategies used by several
the grade obtained on the tests concurred to determine the learners with different abilities (“A,” “C,” and “F” students)
final grade of the course exam. The test, containing a set of to complete the tests [5]. In particular, the frequency of
25 items to be completed in a maximum of 20 minutes, several habits has been recorded and then correlated to the
was administered to 71 learners, who took it concurrently learners’ final mark on the test, such as the habit of giving a
in the same laboratory. The assessment strategy did not rash answer to the question as soon as a plausible option is
prescribe penalties for incorrect responses, and the detected, often neglecting to consider further options; the
learners were aware of that. The logger was enabled, habit of initially skipping questions whose answer is more
and an approximately 4-Mbyte-sized XML log file was uncertain, in order to evaluate them subsequently; and so
produced. The logging activity produced no visible system on. The above experiments only consider some aspects of
performance degradation. the test execution. We have found no study in the literature
that considers the learner behavior as a whole. By using our
Then, the Log Analyzer has been used for analyzing the
approach, we have replicated several experiments per-
logs in order to extract information from them and to
formed for traditional testing in an online fashion. In
graphically represent it in order to trigger a visual data
particular, we have analyzed the effects of response
mining process where the tutor plays a central role. In the changing and question skipping. More importantly, we
case of the mentioned experiments, the visual analysis of have tried to put these aspects together, obtaining a more
charts enabled a tutor to infer interesting conclusions about general analysis of the strategies used by the learners to
both the strategies the learners used to complete tests and complete the tests. As for response changing, we have
the correlation between questions. In the former case, the obtained the average number of response changes, includ-
objective was not only to understand the learners’ strategies ing wrong-to-right and right-to-wrong ones, and their
but also to detect the most successful of them under the correlation with the final score. The results obtained in
circumstances explained above and, possibly, to provide our experiments are summarized in Table 1. The table
learners with advice on how to perform better next time. On reports the following measures:
the other hand, question correlation analysis aims to
improve the final quality of the test by avoiding the . UserName. This measure refers to the name used by
composition of tests with related questions. the e-testing system to uniquely identify a learner.
. R2W. This measure refers to the number or right-to-
5.1 Strategies for Executing Tests wrong changes made by the learner during a test
Some of the experiments found in literature, aiming to session.
analyze learner’s behavior during tests, have regarded the . W2R. This measure refers to the number of wrong-to-
monitoring of the learner’s habit to verify the given right changes made by the learner during a test
answers and to change them with other more plausible session.
options [1], [2]. These experiments try to answer the . NOC. This measure refers to the total number of
following two questions: changes made by the learner during a test session.
This number also includes wrong-to-wrong changes
. Is it better for learners to trust their first impression not considered in the previous measures.
or to go back for evaluating the items again and, . Score. This measure refers to the number of questions
eventually, for changing the given answers? correctly answered by the learner during the test
. Are the strongest or the weakest learners those who session (out of 25).
change ideas more frequently? Even in our experiment, wrong-to-right changes are slightly
In the experiments, right-to-wrong and wrong-to-right more frequent than right-to-wrong ones, as shown by the
changes have been recorded, and their numbers have been average values for these measures.
correlated to the learners’ final mark on the test. Similar As for question skipping, we have analyzed its
experiments have been carried out by correlating the time to correlation with the final score: we have divided the
complete the test with the final mark [3], [4]. An interesting learners participating in the experiment into three groups
Authorized licensed use limited to: TAGORE ENGINEERING COLLEGE. Downloaded on June 4, 2009 at 03:52 from IEEE Xplore. Restrictions apply.
780 IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, VOL. 21, NO. 6, JUNE 2009
Fig. 9. Sample of test execution strategies. (a) Single Phase. (b) Active Revising. (c) Passive Revising. (d) Atypical.
Authorized licensed use limited to: TAGORE ENGINEERING COLLEGE. Downloaded on June 4, 2009 at 03:52 from IEEE Xplore. Restrictions apply.
COSTAGLIOLA ET AL.: MONITORING ONLINE TESTS THROUGH DATA VISUALIZATION 781
Fig. 10. Strategy comparison. (a) Strategy usage. (b) Avg. score. (c) Score tendency.
The remaining time is used for one or more revising by definition, the Active-Revising strategy prescribes the
phases. While in Active Revising, the learner has the skipping ð¼ less reasoningÞ of the questions whose answer is
“active” role of answering the previously skipped more uncertain for the learners. Another predictable result,
questions, in Passive Revising, s/he only checks the due to the above arguments, is that the Passive-Revising
previously answered questions. Also, in this case, strategy has less phases than the Active one on the average
we discovered the same general rule regarding (2.55 and 3.2, respectively).
subsequent phases: the first phase lasts longer, and
subsequent phases have decreasing durations. A 5.2 Detection of Correlation among Questions
sample of this strategy is shown in Fig. 9c. By visually inspecting the charts, we can also be assisted in
By visually analyzing the data of our experiment (Fig. 10), it the detection of correlated questions. In some cases, a visual
came out that the most frequently adopted strategy is Active pattern similar to a stalactite occurs in the chart, as shown
Revising, which was used by 40 learners out of 71 (56.5 per- in Fig. 11. The occurrence of such a pattern says that while
cent), followed by the Passive-Revising strategy (20 learners the learner is browsing the current question s/he could
out of 71, 28.2 percent) and by the Single-Phase one, used only infer the right answer by a previous question. In the
in 9 cases out of 71 (12.7 percent). Only two learners have example shown in the figure, while browsing question "2",
adopted an atypical strategy (see Fig. 9d), which cannot be the learner understood that s/he had given a wrong
classified in any of the previously described patterns. The response to question “1” and went back to change the
best results have been achieved by learners who adopted the response, from option 5 to option 1.
Passive-Revising strategy, with an average score of 17.6 exact
responses out of the 25 test questions. With Active Revising,
on the other hand, an average score of 16.4 has been
achieved. Last, the Single-Phase strategy turned out to be the
worst one, showing an average score of 15.1. Therefore, it
appears that a winning strategy is one of using more than
one phase, and this is confirmed by the slightly positive linear
correlation (0.14) observed between the number of phases
and the score achieved on the test. For both strategies using
more than one phase (active and passive revising), the score is
often improved through the execution of a new phase. The
improvement is evident in the early phases and tends to be
negligible in subsequent phases: starting from a value of 14.3
obtained after the first phase, the average score increases to
16.2 after the second phase. The presence of further phases
brings the final score to an average value of 16.5. The average
duration of the first phase in the Passive-Revising strategy
(14’50”) is longer than the one registered for the Active- Fig. 11. A Stalactite in the chart shows the correlation between two
Revising strategy (11’51”). This result was predictable, since questions.
Authorized licensed use limited to: TAGORE ENGINEERING COLLEGE. Downloaded on June 4, 2009 at 03:52 from IEEE Xplore. Restrictions apply.
782 IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, VOL. 21, NO. 6, JUNE 2009
Authorized licensed use limited to: TAGORE ENGINEERING COLLEGE. Downloaded on June 4, 2009 at 03:52 from IEEE Xplore. Restrictions apply.
COSTAGLIOLA ET AL.: MONITORING ONLINE TESTS THROUGH DATA VISUALIZATION 783
visualization of information regarding the activity in the [5] L. McClain, “Behavior During Examinations: A Comparison of
‘A,’ ‘C,’ and ‘F’ Students,” Teaching of Psychology, vol. 10, no. 2,
discussion forums is also a concern of the work presented by pp. 69-71, 1983.
May et al. [43]. Particular attention in the paper has been [6] D. Hand, H. Mannila, and P. Smyth, Principles of Data Mining,
devoted to the description of technological aspects such as Adaptive Computation and Machine Learning Series, A Bradford
the integration of the data tracked on the client side with Book, MIT Press, 2001.
[7] N. Ye, “Introduction,” The Handbook of Data Mining. Lawrence
those recorded on the server side and the possibility of Erlbaum Assoc., 2003.
instantiating the data logging framework on many e- [8] C. Plaisant and B. Shneiderman, “Show Me! Guidelines for
learning systems, as we have done in our work too. The Producing Recorded Demonstrations,” Proc. IEEE Symp. Visual
technique of tracking client-side events has also been Languages and Human-Centric Computing (VLHCC ’05), pp. 171-178,
2005.
employed by Hardy et al. [45], in reference to the paths [9] R. Mazza and V. Dimitrova, “Student Tracking and Personaliza-
followed by the learners in the browsing of learning tion: Visualising Student Tracking Data to Support Instructors in
material. Typical data gathered in this activity are the Web-Based Distance Education,” Proc. 13th Int’l World Wide Web
frequency and the duration of the accesses to online Conf. Alternate Track Papers and Posters, pp. 154-161, 2004.
[10] R.S. Baker, A.T. Corbett, K.R. Koedinger, and A.Z. Wagner, “Off-
material. Another activity performed by the authors is the Task Behavior in the Cognitive Tutor Classroom: When Students
analysis of the correlation between the behavioral data and ‘Game the System’,” Proc. ACM SIGCHI Conf. Human Factors in
the results achieved in the exams. The same thing has been Computing Systems (CHI ’04), pp. 383-390, 2004.
done in our work. Nevertheless, while the learner’s behavior [11] Asynchronous JavaScript Technology and XML (Ajax) with the Java
Platform, http://java.sun.com/developer/technicalArticles/
during test execution seems to be correlated to the final score J2EE/AJAX/, 2007.
of the tests, the authors have not found any interesting result [12] G. Costagliola, F. Ferrucci, V. Fuccella, and F. Gioviale, “A Web
about the correlation of the data on the browsing of Based Tool for Assessment and Self-Assessment,” Proc. Second
courseware material with the score achieved on the final Int’l Conf. Information Technology: Research and Education (ITRE ’04),
pp. 131-135, 2004.
exam of the course. [13] U. Demsar, “Data Mining of Geospatial Data: Combining Visual
and Automatic Methods,” PhD dissertation, Dept. of Urban
Planning and Environment, School of Architecture and the Built
7 CONCLUSION Environment, Royal Inst. of Technology (KTH), 2006.
[14] U. Fayyad and G. Grinstein, “Introduction,” Information Visualisa-
We have presented an approach and a system to let tutors tion in Data Mining and Knowledge Discovery, Morgan Kaufmann,
monitor learners’ strategies during online tests. The approach 2002.
exploits data visualization to draw the data characterizing the [15] D.A. Keim and M. Ward, “Visualization,” Intelligent Data Analysis,
M. Berthold and D.J. Hand, eds., second ed. Springer, 2003.
learner’s test strategy, in order to trigger the tutor’s attention
[16] D.A. Keim, Visual Exploration of Large Data Sets, second ed.
and to let him/her discover previously unknown behavioral Springer, 2003.
patterns of the learners and conceptual relationships among [17] G. Grinstein and M. Ward, “Introduction to Data Visualization,”
test items. In this way, the tutor is provided with a powerful Information Visualisation in Data Mining and Knowledge Discovery,
Morgan Kaufmann, 2002.
tool that lets him/her review the whole assessment process [18] P.E. Hoffman and G.G. Grinstein, “A Survey of Visualizations for
and evaluate possible improvements. High-Dimensional Data Mining,” Information Visualization in Data
We have extensively used the implemented system Mining and Knowledge Discovery, pp. 47-82, 2002.
[19] M. Ankerst, “Visual Data Mining,” PhD dissertation, Ludwig
experimentally to evaluate online test strategies in the
Maximilians Universitat, Munchen, Germany, 2000.
courses of our faculty, in order to assess the whole approach. [20] D.A. Keim, W. Müller, and H. Schumann, “Visual Data Mining,”
This lets us discover several relevant patterns regarding the STAR Proc. Eurographics ’02, D. Fellner and R. Scopigno, eds.,
test quality, the characteristics of used strategies, and the Sept. 2002.
[21] D.A. Keim, “Information Visualization and Visual Data Mining,”
impact on the final score. IEEE Trans. Visualization and Computer Graphics, vol. 8, no. 1,
In the future, we would like to devise new visual pp. 1-8, Jan.-Mar. 2002.
representations and perform further experiments, possibly [22] D.A. Keim, C. Panse, M. Sips, and S.C. North, “Pixel Based Visual
Data Mining of Geo-Spatial Data,” Computers & Graphics, vol. 28,
in combination with classical data mining algorithms. no. 3, pp. 327-344, 2004.
Moreover, since the approach lends itself to the application [23] K. Cox, S. Eick, and G. Wills, “Brief Application Description—
to other application fields such as e-commerce, in the Visual Data Mining: Recognizing Telephone Calling Fraud,”
Data Mining and Knowledge Discovery, vol. 1, pp. 225-231, 1997.
future, we would like to evaluate its use in these contexts.
[24] A. Inselberg, “Visualization and Data Mining of High-Dimensional
Finally, we wish to explore more deeply the problem of Data,” Chemometrics and Intelligent Laboratory Systems, vol. 60,
cheating detection, which we just treat in broad terms here, pp. 147-159, 2002.
since we believe that an approach based on logging and [25] L. Chittaro, C. Combi, and G. Trapasso, “Data Mining on
Temporal Data: A Visual Approach and Its Clinical Application
visualization can be promising and effective to this aim. to Hemodialysis,” J. Visual Languages and Computing, vol. 14,
pp. 591-620, 2003.
[26] H. Miller and J. Han, “An Overview,” Geographic Data Mining and
REFERENCES Knowledge Discovery, pp. 3-32, Taylor and Francis, 2001.
[1] J. Bath, “Answer-Changing Behaviour on Objective Examina- [27] I. Kopanakis and B. Theodoulidis, “Visual Data Mining Modeling
tions,” J. Educational Research, no. 61, pp. 105-107, 1967. Techniques for the Visualization of Mining Outcomes,” J. Visual
[2] J.B. Best, “Item Difficulty and Answer Changing,” Teaching of Languages and Computing, no. 14, pp. 543-589, 2003.
Psychology, vol. 6, no. 4, pp. 228-240, 1979. [28] M. Kreuseler and H. Schumann, “A Flexible Approach for Visual
[3] J. Johnston, “Exam Taking Speed and Grades,” Teaching of Data Mining,” IEEE Trans. Visualization and Computer Graphics,
Psychology, no. 4, pp. 148-149, 1977. vol. 8, no. 1, pp. 39-51, Jan.-Mar. 2002.
[4] C.A. Paul and J.S. Rosenkoetter, “The Relationship between the [29] G. Manco, C. Pizzuti, and D. Talia, “Eureka!: An Interactive and
Time Taken to Complete an Examination and the Test Score Visual Knowledge Discovery Tool,” J. Visual Languages and
Received,” Teaching of Psychology, no. 7, pp. 108-109, 1980. Computing, vol. 15, pp. 1-35, 2004.
Authorized licensed use limited to: TAGORE ENGINEERING COLLEGE. Downloaded on June 4, 2009 at 03:52 from IEEE Xplore. Restrictions apply.
784 IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, VOL. 21, NO. 6, JUNE 2009
[30] S. Kimani, S. Lodi, T. Catarci, G. Santucci, and C. Sartori, Gennaro Costagliola received the laurea de-
“Vidamine: A Visual Data Mining Environment,” J. Visual gree in computer science from the Università
Languages and Computing, vol. 15, pp. 37-67, 2004. degli Studi di Salerno, Fisciano, Italy, in 1987.
[31] I. Kopanakis, N. Pelekis, H. Karanikas, and T. Mavroudkis, Visual Following four years postgraduate studies,
Techniques for the Interpretation of Data Mining Outcomes, pp. 25-35. starting in 1999, at the University of Pittsburgh,
Springer, 2005. he joined the Faculty of Science, Università degli
[32] P. Buono and M. Costabile, “Visualizing Association Rules in a Studi di Salerno, where in 2001, he reached the
Framework for Visual Data Mining,” From Integrated Publication position of professor of computer science in the
and Information Systems to Virtual Information and Knowledge Dipartimento di Matematica ed Informatica. His
Environments, Essays Dedicated to Erich J. Neuhold on the Occasion research interests include Web systems and
of His 65th Birthday, pp. 221-231, Springer, 2005. e-learning and the theory, implementation, and applications of visual
[33] U. Fayyad, G. Piatetsky-Shapiro, and P. Smyth, “From Data languages. He is a member of the IEEE, the IEEE Computer Society,
Mining to Knowledge Discovery in Databases,” AI Magazine, and the ACM.
pp. 37-54, 1996.
[34] M.C. Chen, J.R. Anderson, and M.H. Sohn, “What Can a Mouse Vittorio Fuccella received the laurea (cum
Cursor Tell Us More?: Correlation of Eye/Mouse Movements on laude) and PhD degrees in computer science
Web Browsing,” Proc. CHI ’01 Extended Abstracts on Human Factors from the University of Salerno, Fisciano, Italy, in
in Computing Systems, pp. 281-282, 2001. 2003 and 2007, respectively. He is currently a
[35] Hibernate, Hibernate Framework, http://www.hibernate.org, 2007. research fellow in computer science in the
[36] ECMAScript, Ecmascript Language Specification, http://www.ecma- Dipartimento di Matematica ed Informatica,
international.org/publications/files/ECMA-ST/Ecma-262.pdf, Università degli Studi di Salerno. He won the
2008. Best Paper Award at ICWL ’07. His research
[37] Xquery 1.0: An XML Query Language, World Wide Web Consortium interests include Web engineering, Web tech-
(W3C) Recommendation, http://www.w3.org/TR/XQuery/, nologies, data mining, and e-learning.
Jan. 2007.
[38] Xquery API for JAVATM (XQJ), http://jcp.org/en/jsr/
detail?id=225, Nov. 2007. Massimiliano Giordano received the laurea
[39] M. Dick, J. Sheard, C. Bareiss, J. Carter, D. Joyce, T. Harding, and degree in computer science from the Università
C. Laxer, “Addressing Student Cheating: Definitions and Solu- degli Studi di Salerno, Fisciano, Italy, in 2003.
tions,” SIGCSE Bull., vol. 35, no. 2, pp. 172-184, 2003. Currently, he is a PhD student in computer
[40] T.S. Harding, D.D. Carpenter, S.M. Montgomery, and N. Steneck, science in the Dipartimento di Matematica ed
“The Current State of Research on Academic Dishonesty among Informatica, Università degli Studi di Salerno.
Engineering Students,” Proc. 31st Ann. Frontiers in Education Conf. His research interests include access control,
(FIE ’01), vol. 3, pp. 13-18, 2001. data mining, data visualization, and e-learning.
[41] S. Mulvenon, R.C. Turner, and S. Thomas, “Techniques for
Detection of Cheating on Standardized Tests Using SAS,” Proc.
26th Ann. SAS Users Group Int’l Conf. (SUGI ’01), pp. 1-6, 2001.
[42] H. Shao, H. Zhao, and G.-R. Chang, “Applying Data Mining to
Detect Fraud Behavior in Customs Declaration,” Proc. Int’l Conf. Giuseppe Polese received the laurea degree in
Machine Learning and Cybernetics (ICMLC ’02), vol. 3, pp. 1241-1244, computer science and the PhD degree in
2002. computer science and applied mathematics from
[43] M. May, S. George, and P. Prévôt, “Tracking, Analyzing and the Università degli Studi di Salerno, Fisciano,
Visualizing Learners’ Activities on Discussion Forums,” Proc. Italy, and the master’s degree in computer
Sixth IASTED Int’l Conf. Web Based Education (WBE ’07), vol. 2, science from the University of Pittsburgh. He is
pp. 649-656, 2007. an associate professor in the Dipartimento di
[44] T. Mochizuki, H. Kato, K. Yaegashi, T. Nagata, T. Nishimori, Matematica ed Informatica, Università degli
S. Hisamatsu, S. Fujitani, J. Nakahara, and M. Suzuki, “Promo- Studi di Salerno. His research interests include
tion of Self-Assessment for Learners in Online Discussion Using visual languages, databases, e-learning, and
the Visualization Software,” Proc. Conf. Computer Support for multimedia software engineering.
Collaborative Learning (CSCL ’05), pp. 440-449, 2005.
[45] J. Hardy, S. Bates, J. Hill, and M. Antonioletti, “Tracking and
Visualization of Student Use of Online Learning Materials in a . For more information on this or any other computing topic,
Large Undergraduate Course,” Proc. Sixth Int’l Conf. Web-Based please visit our Digital Library at www.computer.org/publications/dlib.
Learning (ICWL ’07), pp. 280-287, 2007.
[46] M. Sasakura and S. Yamasaki, “A Framework for Adaptive
E-Learning Systems in Higher Education with Information
Visualization,” Proc. 11th Int’l Conf. Information Visualization
(IV ’07), pp. 819-824, 2007.
[47] G.K.L. Tam, R.W.H. Lau, and J. Zhao, “A 3D Geometry Search
Engine in Support of Learning,” Proc. Sixth Int’l Conf. Web-Based
Learning (ICWL ’07), pp. 248-255, 2007.
[48] Q.V. Nguyen, M.L. Huang, and I. Hawryszkiewycz, “A New
Visualization Approach for Supporting Knowledge Management
and Collaboration in e-Learning,” Proc. Eighth Int’l Conf. Informa-
tion Visualisation (IV ’04), pp. 693-700, 2004.
[49] C.G. da Silva and H. da Rocha, “Learning Management Systems’
Database Exploration by Means of Information Visualization-
Based Query Tools,” Proc. Seventh IEEE Int’l Conf. Advanced
Learning Technologies (ICALT ’07), pp. 543-545, 2007.
Authorized licensed use limited to: TAGORE ENGINEERING COLLEGE. Downloaded on June 4, 2009 at 03:52 from IEEE Xplore. Restrictions apply.