Research interests

My main research interests concern the notions of document structures and genres, — and the somewhat broader notion of document architecture. It implies a view on textual artefacts as objects with an internal structure related to the social and technological settings of the process of document production, as well as to the discursively constituted characteristics of document use.

Such interests have emanated from a long term study of markup languages and of the flexibility and usability of electronic documents, — phenomena that have been part of almost every course in which I have been involved since the early 1990s.

Thesis work

My thesis work is focussed on the relation between markup, document structure and genres. I want to investigate the ways in which markup can be used for automatic clustering according genres, where the notion of genre refers to certain formal artefactual characteristics deeply rooted in discursively determined communication and documentation tasks. The methods used take account of typified patterns of natural language use as well as of structural constituents of the documents.

The method used can be described as a two step process in which I first use machine learning techniques for the disambiguation of markup, which output is document representations that consist of a set of representations for each element in a document and a representation of some features of the document as a unit. Clustering techniques based on the disambiguated markup are then used for genre identification. Since it is presupposed that genres are dynamic phenomena susceptible to constant change and the amount of possible genres under constant change as well, clustering techniques are preferred, in place of classification.

Formal affiliations

Since september 2002 I have been a Ph.D student at the Swedish School of Library and Information Science at Högskolan i Borås and at the Swedish National Graduate School of Language Technology. Main supervisor is Tor Henriksen and additional supervisor Barbara Gawronska at Högskolan i Skövde.

PhD Courses attended

Tillämpad vetenskapsteori (5 credits) - SSLIS course spring 2004, directed by Jonas Larsson.

Information Access (5 credits) - GSLT course fall 2003, directed by Barbara Gawronska.

Machine Learning (5 credits) - GSLT course fall 2003, directed by Joakim Nivre.

Multi-paradigm programming in Oz (5 credits) - GSLT course spring 2003, directed by Torbjörn Lager.

Theory Development (10 credits) - SSLIS course spring 2003, directed by Lars Höglund and Diane Sonnenwald.

Linguistic Resources (5 credits) - GSLT course fall 2002, directed by Lars Borin.

Natural Language Processing (5 credits) - GSLT course fall 2002, directed by Joakim Nivre.

Vår tids filosofi (10 credits) - SSLIS 1998, directed by Michael Azar.

Course Papers (a choice)

Gunnarsson, M. (2004). The Metaphysics of Library and Information Science. (A paper for a PhD course on Applied Theory of Sciences).

Gunnarsson, M. (2004). From Information to Documents and Back Again. (A paper for a PhD course on Information Access).

Ekeklint, Susanne, & Gunnarsson, M. (2004) Dependency Grammars and Memory-Based Learning. (A paper for a PhD course on Machine Learning)

Gunnarsson, M. (2003). Toward a Theory of Document Architectures. (A paper for a PhD course on Theory Development)

Bjarnadóttir, Kristín, & Gunnarsson, M. (2003). Copyright and the Web as Corpus (A paper for a PhD course on Linguistic Resources)

Other papers and textual work

Hansson, Joacim, Mats Dahlström, Helena Francke & Mikael Gunnarsson (2003). Documents in Library and Information Science – Sociotechnical dimensions in document genre and architecture studies. Paper presented at The Document Academy Annual Meeting '03, Berkeley, CA, August 13-15 2003. [Preliminary version]

Gunnarsson, M., Lingefjärd, T., Mekki-Berada, T., & Sjöblom, C-A. (2002). Flexibelt lärande - lärande examination. Göteborgs. Univ. (UFL-rapport 2002:1).

Gunnarsson, M. & Dahlström, M (2000). DA draws a circle: on document architecture and its relation to library and information science education and research. Information Research, Vol. 5, No. 2.

Gunnarsson, M. & Dahlström, M (1999).On Document Arcitecture (Paper presented at The 3rd British Nordic Conference on Library and Information Science, April, 1999.).

Gunnarsson, M. & Dahlström, M (1999).Hypertext '97 : Southampton - rapport från konferensen. Human IT , nr. 2.

Gunnarsson, M (1998).Webben som en text (The Web as a Text). Human IT, nr. 3.

Gunnarsson, M. (1998). En rapport från EP98, St. Malo, 1-3 april 1998. Human IT , nr. 2.