HST weekly schedule and travel plans
Accelerated Natural Language Processing (ANLP), Winter Term 2016
Foundations of Natural Language Processing (FNLP), Spring Term 2017
Contacting me (including PGP key)
Index of all documents on this site of potential general interest
I'm based in the Institute for Language, Cognition and Computation of the School of Informatics at the University of Edinburgh, with the title "Professor of Web Informatics". I'm interested in the Architecture of the Web, Markup Languages, the Foundations of Cognitive Science, as well as Computational Linguistics, Data-Intensive Linguistics, Language Corpora and Corpus Management.
Since February 2005 many of my activities have focussed on my membership in the W3C Technical Architecture Group and my work within W3C working groups:
If you're interested in pursuing an MSc or PhD in one of these areas, please see the Informatics postgraduate prospectus and/or get in touch with me.
Outside my University time I do consulting and business mentoring in the area of XML, XML Pipelines and XML Schema via Markup Systems.
Version 1.2 of LT XML, a fully compliant XML tool kit and API for WIN32 and UN*X platforms, is available.
The beta of XED, my XML document instance editor is still available.
The current version of XSV, an XML Schema validator, is available via a web interface.
An add-on to Python's SAX functionality providing a simple 'pull'-style interface PullFromSAX.py
The beta version of xslj, an old (not-standard) XSL to DSSSL translator is still available.
I helped launch the use of standoff markup to improve annotation management in complex datasets: the underlying techonology is described in my SGML Europe '97 paper. My presentation to the COCOSDA meeting in Rhodes discusses the application of this technology to spoken language transcripts, available as Powerpoint v.7 version, Powerpoint v.4 version and quick and dirty HTML from Powerpoint outline.
This section is of historical interest only at this point -- I haven't worked on this stuff for years.
DSC version 2.0, an online syntax checker, normaliser and implementation framework for DSSSL, based on embedding a full R4RS Scheme interpreter in James Clark's SP parser, is available for downloading. For more information, see the release announcement, which describes dsc in more detail.
Version 2.0, as demonstrated at SGML/XML '97 in November 1997, provides a much richer implementation framework than previous versions, including the full query language and the transformation language.
DSSSL users might find my index to DSSSL procedures by prototype useful. I've also produced a summary of information about the copyright status of the DSSSL standard and pointers to various electronic versions thereof.
For DSSSL/SGML implementation mavens, heres an illustrated example of an SGML source grove.