Towards a Base Architecture for Spoken Language Transcript{s,tion}
Henry S. Thompson
Language Technology Group
HCRC, University of Edinburgh

26 September 1997

1. Acknowledgements

This work was carried out at the Human Communication Research Centre, whose baseline funding comes from the UK Economic and Social Research Council, and at the Institute for Research on Cognitive Science, University of Pennsylvania, funded by NSF and (D)ARPA.

2. Background
3. Background, cont'd
4. Evolutionary Pressure
5. Opportunity
6. Diversion: XML
7. XML: Designed for Use
8. LT XML: Free tools for XML applications
9. LT XML: Basic Architecture
10. What's "Stand-off" Annotation?
11. Hyperlink Semantics for stand-off
12. An Aside about Architecture
13. A Simple Example

Base:
 <w id=w12>Now</w>
 <w id=w13>is</w>
 <w id=w14>the</w>
. . .
 <w id=w28>party</w>
 <c id=c4>.</c>

Standoff:
 <s xml-type='link' show='include'
      href='&f;#id(w12)..id(c4)'>
 </s>
14. Simple example, cont'd

<s>
 <w id=w12>Now</w>
 <w id=w13>is</w>
 <w id=w14>the</w>
. . .
 <w id=w28>party</w>
 <c id=c4>.</c>
</s>
<s>
 . . .
</s>
15. Inclusion
16. Simple Tools are Simple to Build
17. Pre-constructed Tools
18. Availability
19. End Diversion: The Bottom Line
20. The Base Level
21. Annotation

Base:
 <tok id='t3' start='1.3'>Saint</tok>
 <tok id='t4' start='1.7'>Joan</tok>
 <tok id='t5' type='clitic'>+'s</tok>
Standoff:
 <w id='w1' href='&f;#id(t3)..id(t4)' tag='pnn'/>
 <w id='w2' href='&f;#id(t5)' tag='bez'>
22. Conclusion