Ontological Excavation:
Microsoft PowerPoint 2000

Home Page

Academic Information
Research Abstract
Publications
Ontological Excavation
Curriculum Vitae
Past Projects
Other

Personal Information
My First Name
Hobbies

Creative Efforts
Biographical
Cosmic Irony
Essays
Movie Reviews
Photography
Random Interest

Literature Excerpts
Essays and Anecdotes
Favorite Poems
Folk Tales and Myths
Historical Writings
Oriental Philosophy
Stories and Fragments

Links


Web Counter

Introduction

Microsoft PowerPoint 2000 is primarily a tool for building and delivering presentations. While it does possess mechanisms that can be exploited to other ends (Bob Balzer gives a demo showing how he turned PowerPoint into a software development environment by using .COM listeners (Balzer, "Living with COTS", ICSE 2002), I  hypothesized that PowerPoint will reveal an ontology that contains a primary cluster of core concepts related to presentations and slides and supporting concepts such as graphics and animations as smaller ontological clusters connected to it – characteristic of a Reef ontological structure. As such, PowerPoint should evidence high conceptual integrity both in the structural analysis of its ontology. This page contains the following:

Excavation Artifacts

These are artifacts produced by Ontological Excavation. The XML versions were generated in Visio 2003 Professional and is probably optimized for Microsoft Internet Explorer. The morphology diagram for PowerPoint will probably not be readable even if you are on a Windows machine. However, if you are able to run the active scripts to get the search bar, you should be able to type in some key word - a label that can be found in the PowerPoint User Interface, for example - and the search will return all the objects in the graph that have that keyword. The abbreviations in the morphology refer to interface elements like buttons (B), Windows (W), Tabbed Panes (TP), List Items (LI), Radio Buttons (RB), Text Fields (TF), and so on. 

Core Concepts of PowerPoint 2000

The table below lists the core concepts of PowerPoint 2000 identified using ontological analysis techniques in order of structural importance::

Core Concept Name

Description

Centrality Value

AutoShape [Draw] Object This is any AutoShape object that is not a line or a connector - this includes Text Boxes. 23.9

Presentation

Presentation is the work product that users create and edit in PowerPoint 21.4
Slide Object A slide object is any object that can be inserted into a slide 17.1
Slide A slide is a frame in a presentation. 16.4
PowerPoint File A PowerPoint file is any file that PowerPoint recognizes 15.4
Slide Show A slide show is how a Presentation is typically viewed by an audience. 15.3
Color Color - everything has color in PowerPoint 15.0
Send To [Destination] A PowerPoint file can be "sent" to an email recipient. 10.8
WordArt This is a graphic that takes text and displays it with special formatting. 10.0
Genigraphics Wizard This is a wizard that provides commercial consulting and packaging services for PowerPoint presentations on the user's request 9.6
Text Text 9.5
Line Anything that is a line in PowerPoint - usually a border for a draw object 9.1
Selection A highlighted set of text or objects or slides in PowerPoint - to be cut, copied, or pasted. 9.0
Online Broadcast [Tool] Allows the user to set up a presentation through a web page for other users to log in and view during the presentation. 8.9
Animation A setting that can be possessed by a slide object or text that animates it during a slide show.  8.2
File A general system file 8.1
Notes Page Object Any object that can be placed on the notes page which can be printed out as a handout. 7.8
[Configuration] An inferred concept encapsulating all options such as Edit and Print options. 7.5

Conceptual Integrity Measures

I have been exploring metrics for measuring the conceptual integrity of computing applications. Thus far, I have identified two possible measures based on graph theory: conceptual coherence and conceptual complexity. I am also testing some combined calculations for the overall conceptual integrity. 

Conceptual Coherence - Conceptual coherence is a measure of an application's interrelatedness of its concepts, and uses average distance between nodes in a graph. The theory (explained in detail here) is that if a semantic network reflects potential data dependencies then a complete connected network contains concepts that are all interrelated and have an average distance of 1.0. The less related the concepts, the greater the average distance. Thus, the hypothesis is that removing those concepts essential to the application's domain model would make the resulting ontology less coherent, appearing as an increase in average distance. Conversely, removing peripheral concepts, not essential to the domain model, would make the resulting ontology more coherent, producing a decrease in average distance. Thus, conceptual coherence values reflect an ontology's "incoherence" where the higher the value, the more incoherent the ontology. 

Conceptual Complexity - An application's conceptual complexity reflects the average number of relationships per node (including attributes which are modeled as nodes in the ontology), and uses the average degree across all nodes in a graph (where a degree is simply the number of edges on a node). The theory (explained in detail here) is that a concept in a semantic network possessing many edges connecting it to its attributes or to other nodes has a high complexity versus a node with few edges. Thus, a complex concept is more likely to have interactions with many other concepts, raising the overall complexity of the ontology.  For conceptual complexity, the hypothesis is that removing those nodes that help to simplify the ontology by organizing concepts will increase the average degree. Removing inherently complex concepts decreases the average degree of the ontology. 

Below are the results of a systematic study where the conceptual coherence and complexity was measured for the ontology ("Original"), then the core concepts were systematically removed, along with any isolated nodes and components that they produced when removed from the graph) and conceptual coherence and complexity were measured again. 

Concept Removed From Ontology Conceptual Incoherence  (average distance) Concept Removed From Ontology Conceptual Complexity (average degree)
PowerPoint File 7.17 Palm Pilot Scheduler 2.83
Slide Show 7.04 Notepad 2.46
AutoShape Draw Object 7.03 Genigraphics Wizard 2.46
Color 7.02 Online Broadcast Tool 2.45
Slide 7.02 AutoShape Draw Object 2.45
Presentation & Slide 7.01 Animation 2.44
Text 6.99 [Configuration] 2.44
Slide Object 6.97 Word Art 2.44
File 6.97 Presentation & Slide 2.44
Line 6.92 Slide Show 2.44
[Configuration] 6.92 Line 2.44
Notes Page Object 6.92 Original 2.44
Selection 6.91 File 2.44
Word Art 6.91 Presentation 2.43
Original 6.89 PowerPoint File 2.43
Animation 6.88 Selection 2.43
Presentation 6.87 Color 2.42
Genigraphics Wizard 6.58 Notes Page Object 2.42
Online Broadcast Tool 6.54 Slide 2.41
Notepad 4.61 Text 2.41
Palm Pilot Scheduler 3.36 Slide Object 2.41
Calculator / Calendar 3.06 CD Player 2.19
CD Player 2.82 Calculator / Calendar 2.00

Conceptual Integrity Metric - I am testing two calculations of conceptual coherence and complexity to provide an approximation of overall conceptual integrity. Currently they are labeled HZ1 and HZ2 (HZ stands for the Hsi-Zook measure). HZ1 is simply the product of coherence and complexity. HZ2 is the sum of the squares of coherence and complexity. These results are here mainly for completeness as these structural metrics will only be found to be meaningful with more data points and will probably have to be normalized against the size of the ontology.

Concept Removed From Ontology

Hsi-Zook 1 (Coherence * Complexity)

Concept Removed From Ontology

Hsi-Zook 2 (Coherence2 + Complexity2
PowerPoint File 17.38 PowerPoint File 57.23
AutoShape Draw Object 17.23 Slide Show 55.51
Slide Show 17.15 AutoShape Draw Object 55.44
Presentation & Slide 17.10 Color 55.19
Color 17.01 Presentation & Slide 55.09
Slide 16.93 Slide 55.05
File 16.93 Text 54.71
[Configuration] 16.90 File 54.49
Text 16.87 Slide Object 54.46
Word Art 16.86 Line 53.87
Line 16.86 [Configuration] 53.85
Slide Object 16.83 Word Art 53.69
Animation 16.81 Notes Page Object 53.66
Selection 16.76 Selection 53.63
Original 16.75 Animation 53.28
Notes Page Object 16.71 Original 53.27
Presentation 16.68 Presentation 53.08
Genigraphics Wizard 16.15 Genigraphics Wizard 49.28
Online Broadcast Tool 16.04 Online Broadcast Tool 48.79
Notepad 11.35 Notepad 27.30
Palm Pilot Scheduler 9.51 Palm Pilot Scheduler 19.30
CD Player 6.18 Calendar / Calculator 13.36
Calendar / Calculator 6.12 CD Player 12.77

A curious result from this analysis is evidence suggesting that removing Presentation improves both the coherence and complexity of the ontology. Intuitively, I believe that if the concept of Presentations were removed from the PowerPoint problem domain, a user could still create slides, handouts, and display slide shows but it wouldn't be called a presentation. It would be something else. Plus, since PowerPoint 2000 has the capability of saving presentations in other formats, the concept of a presentation might just serve as a general organizing concept. Not curious are the metrics that show that removing the Genigraphics Wizard and Online Broadcast Tools reduce the incoherence of PowerPoint.

Use Case Silhouette

A use cases are from the Unified Software Process. They are essentially scenarios modeled as a series of actions that systems perform to achieve a goal for a user. A use case silhouette highlights those concepts activated by elements in the morphology. For PowerPoint 2000, we used Microsoft PowerPoint 2000 for Dummies for Windows by Doug Lowe.

PowerPoint 2000 Use Case Silhouette Statistics

Source

PowerPoint 2000 for Windows for Dummies

# of use cases:

199

# concepts invoked:

499

Total # concepts

1686

Ontological coverage:

30 %

We have hypothesized that a set of use cases reflecting average usage of an application should invoke a concepts in the ontology with a frequency that should parallel the structural importance of this concept within the ontology.  In other words, core concepts identified from the ontology should also be the most important concepts in a typical set of use cases. We determined frequency by counting the number of times a concept appeared in a use case. The table below lists the most frequently accessed concepts in the use cases along with their centrality values and whether they have membership in the set of core concepts.

Partial List of Concepts Ordered by Times Referenced in Use Cases

Concept Name

Frequency

Centrality Value

Core?

Slide

48

16.4

Y

Presentation

43

21.4

Y

Text

34

9.5

Y

Selection

34

9.0

Y

Color

23

15.0

Y

AutoShape [Draw] Object

21

23.7

Y

Current Slide

20

2.4

 

Slide View

20

1.2

 

Slide Master

18

0.3

 

Font [Format]

16

3.4

 

File

13

8.1

Y

Position (of Slide Object)

12

0.2

 

Outline View

12

0.4

 

Text box

11

3.8

 

Fill

11

4.4

 

Active Presentation

11

6.0

 

Slide Sorter View

11

0.2

 

Color Scheme

10

0.9

 

Slide Show

10

15.3

Y

Copy From (Clipboard)

10

0.0

 

Sound

9

5.2

 

Paste From (Clipboard)

9

0.0

 

Outline

8

3.8

 

Title Text

8

1.8

 

Notes Master

8

0.1

 

Paragraph

8

1.5

 

Bullet

8

0.9

 

Line

8

9.1

Y

Picture [Clip] (Clip Art)

8

0.1

 

Normal View

8

0.1

 

Many of the concepts on the list do not appear as core concepts in the ontology. Some of this can be attributed to how the use cases were written. For example, Current Slide appears frequently because a use case will start “Move to the slide” – which refers to the current slide as opposed to a more general reference to slides. In the case of the Views, many use cases would have an instruction “Switch to the Slide View”.  Clipboard functions – copying and pasted – were also referenced often throughout the cases as methods for placing certain objects.

The use case silhouette does show that the Slide Master has much more importance in PowerPoint usage than the structural metrics suggest. The author referenced using slide masters for many things, including creating backgrounds, templates, changing footers, and so on. We first checked to see if we had made a modeling error in our ontology. The figure below shows the Slide Master ontology.

While we stand by our use of an association to show the relationship of the Slide Master to Slide – that the Slide Master formats all Slides in a Presentation – this example shows a potential weakness of our representation. The Slide Master formatting affects all the slides in a presentation and thus, it would follow that a user would take extra care to ensure that the Master was designed correctly. On the other hand, this example demonstrates how use case silhouetting can reveal potential areas of concern for developers that would not have emerged in an abstract requirements analysis or testing methodology.

At least three of the core concepts can be found on both lists: Presentation, Slide, and AutoShape [Draw] Object. Text, Selection, and Color are also core concepts and Slide Show does appear as an important concept in the Use Cases, as do Line and File. While this partially proves our hypothesis, there are a number of concepts that do not appear as frequently in the use cases as their centrality values would suggest. We have listed these below

Core Concepts Absent from Use Case Silhouettes

Concepts Missing from

Use Case Silhouette

Centrality Value

Slide Object

17.1

PowerPoint File

15.4

Send To [Destination]

10.9

WordArt

10.0

Genigraphics Wizard

9.6

Online Broadcast [Tool]

8.9

Broadcasts

8.6

Send To

8.3

Animation

8.2

Notes Page Object

7.8

[Configuration]

7.5

Slide Object, PowerPoint File, Notes Page Object, Animation, and [Configuration] are partially inferred concepts that we structured to model certain important generalizations. Because they were constructed somewhat indirectly from the PowerPoint application for modeling convenience, we would expect most use cases not to refer to them explicitly. However, AutoShape objects, Text Boxes, and ClipArt are considered Slide and Notes Page Objects. Slide Objects have animations and action settings. Animation ties together the concepts of Slide Transition, Entry Animation, and Preset Animation. Presentations and Slide Shows are also PowerPoint Files. The [Configuration] models the possible global options that can be set in the PowerPoint Tools Menu. If we accounted for the complete generalization and aggregation in the counts of those particular concepts, they would appear with more frequency across all the use cases.

The other missing concepts seem to confirm what we had proposed earlier – that the features whose removal from PowerPoint 2000 improved its conceptual coherence are peripheral to the application, despite their centrality values. Online Broadcast, the Genigraphics Wizard, and WordArt all have sufficiently large ontologies that they appear as core concepts in the ontology. Now, a use case silhouette consisting of a set of ‘likely’ uses of the application show that these are not very important to users. One use case was devoted to each topic, on average. It is likely that if we were to perform the same analysis using actual user data that these concepts may disappear entirely from the analysis.

Visualizations

To supplement the analysis and make the excavation artifacts more tractable for viewing , I created visualizations of the large graphs representing the morphology and ontology of the application. Visualizations were generated by UCINET (Borgatti, S.P., Everett, M.G. and Freeman, L.C. 2002. UCINET for Windows: Software for Social Network Analysis. Harvard: Analytic Technologies). The morphology and ontology graphs were saved as DL-format files (essentially adjacency list representations for the graphs they represent) which were then imported into NetDraw (Borgatti, S.P. 2002. Netdraw. Harvard: Analytic Technologies). After applying a spring embedding algorithm, which sets the length of an edge between two nodes as a function of the other nodes in the neighborhood, the diagram is saved as a KineMAGE. MAGE is a program for visualizing organic molecules and protein configurations developed by David Richardson through the Biochemistry Department at Duke University, NC.  I added additional colorings of the kinemages to highlight salient features in the diagrams.

  • PowerPoint 2000 Morphology Closeness centrality - The 'starburst' shape is characteristic of typical desktop applications that organize their functionality around menu bars. The color intensity reflects the average distance a node is from all other nodes. Hotter (red colors) indicate nodes that are easily accessible from the user interface. Colder colors are more distant or 'deeper' in the interface.
  • PowerPoint 2000 Morphology - colored by Menus or by functionality related to Menus
    • Red - Insert Menu Items
    • Orange - Format Menu Items
    • Yellow - Slide Show Items
    • Green - File Menu Items
    • Blue - Tool Menu Items
    • Hot Pink - Main Window
    • Purple - Edit Menu
    • Pink - View Menu
  • PowerPoint 2000 Ontology - This visualization seems to make a clear case for the conceptual coherence of PowerPoint 2000 - a large cluster of concepts with some protruding collections of concepts.
  • PowerPoint 2000 Peripheral Elements Colored - I colored the protruding elements in the visualization of the ontology by identifying their group. They turn out to be (at least from the angle shown) - the Genigraphics Wizard (red), the Online Broadcast Tool (gold), and the Print Settings (sea green). Thinking of applications and feature bloat - as a gardening metaphor - these features look like they could be 'trimmed' from the rest of the ontology without disturbing the integrity of the application's core features.
  • PowerPoint Graphics Functions - Ontology. Displays all the concepts directly related to graphics.
  • PowerPoint Centrality Measures. I colored and enlarged the more central nodes in order of their centrality with respect to their position in the graph. The visualization does seem to suggest that PowerPoint has some correspondence with the hypothetical Reef Ontological Structure
  • Use Case Ontological Coverage of PowerPoint 2000 for Windows for Dummies - Showing the ontological coverage of the use cases. Green nodes are concepts activated by the use cases. The use cases cover 30% of the ontology.
  • Use Case Silhouette of PowerPoint 2000 for Windows for Dummies contrasted with PowerPoint Centrality Measures. I intensity coded the Use Case Silhouette by size and color relative to a nodes frequency of reference in the use cases. Note that the central concepts are roughly the same concepts frequently accessed in both diagrams (red and orange). Also note how some of the outer branches - notably the Genigraphics Wizard and the Online Broadcast Tool have dropped out in the Use Case Silhouette.
  • Use Case Imaging - A Use Case Image is the analogue to a medical imaging technique such as Magnetic Resonance Imaging or Computer-Aided Tomography. It basically highlights those portions of the ontology that are of interest relative to a specific task. One of our future projects is to be able to take a user log and animate a UCI to show which parts of the ontology are activated during a task.