EVALUATING SOFTWARE VISUALIZATION USE FOR COMPUTER SCIENCE EDUCATION

CS7390 WINTER '98

ASHLEY TAYLOR
















INDEX
















EVALUATION METHODS

Empirical/Experimental

Observational/Naturalistic


Prev Index Next














Empirically Assessing Algorithm Animations as Learning Aids
Stasko and Lawrence

Study 1

Stasko, et.al. (1993) compared use by grad students of

Post-test showed non-significant advantage for animation group.
Animation group finished post-test slightly faster.

DESIGN FLAW?

"Most of the test items require the ability to accurately carry out the main procedures of the algorithm, and neither presentation seemed likely to give participants that ability."
Prev Index Next














Study 2

Lawrence and Stasko (1994) compared several usage modes for undergrads with Prim's minimum spanning tree alg.


Prev Index Next














Issues Raised In Other Studies


Prev Index Next














A Principled Approach to the Evaluation of SV: A Case Study in Prolog
Paul Mulholland

Combines Empirical and Observational Approaches

"Much of the empirical work ... suffers from the problem of trying to find global generalisations that are not there."

"Many of the studies derive performance results without deriving the information necessary to explain them. Observations are usually not made of how the subjects approached the task and what features they found confusing."

Prev Index Next














STUDY

Compare four Prolog SV's using empirical + observational + cognitive approach.

Test task - given program and source code, identify differences with modified program (output only, no source code)

Program Differences:


Prev Index Next














METHOD

8 pairs of students used each of four SV's:

Spy Trace execution using Unification model
PTP (Prolog Trace Package )

Give more execution details than Spy

TPM Transparent Prolog Machine Show execution graphically using depth first AND/OR trees Provide overview, detailed views
TTT Textual Tree Tracer Trace using close format to source code



Common Execution Environment (PPVL) used for all SV's:


Prev Index Next














RESULTS
Main Types of Misunderstanding:

Total Instances of Misunderstanding:
Least PTP
SPY/TTT
Most TPM

However, TPM best for Data Flow.

Prev Index Next















Number of Problems Solved in Given Time:

Most PTP
TTT
Spy
Least TPM

SOME CONCLUSIONS


Prev Index Next














Testing Effectiveness of Algorithm Animation
Gurka and Citrin

Discusses Issues related to evaluation of Algorithm Animation and Software Visualization:

"...it is possible that animations, as they are now created and used, are not particularly effective for teaching algorithms. "

"Since intuition suggests otherwise, it is important to continue research until definitive results are obtained.... if we can show ineffectiveness, perhaps the same research methods can suggest new modes for using animation that will succeed."

Propose that animations can be used for:


Prev Index Next














Observational Studies

Empirical Studies


Prev Index Next














Animation factors for design and evaluation


1. Usability difficulties.
2. Animation quality, discussed below.
3. System training, including tutorials on exploration techniques.
4. System availability.
5. Type of animation use, i.e. lecture, lab, self-study.
6. Individual differences among learners, particularly exploratory and visual learning ability.
7. Algorithm complexity. Some require multi-level presentation?

Prev Index Next














Each algorithm should be examined to see

Design Principles from Art/Cinema should be used.
Multimedia design study [Faraday and Sutcliffe 97] supports this.

Prev Index Next














RECENT WORK AT TECH.

Grad Students try to learn Complex Data Structure (Binomial Heap)

Materials
Text + Pseudocode + Anim
OR
Text + Pseudocode + Static Figures

PRELIMINARY RESULTS


Prev Index Next














Student Animation Construction

(Stasko, Newstetter, Taylor)

Prev Index