On document architecture: and its relation to LIS education and researchThe 3rd British Nordic Conference on Library and Information Science, April, 1999. Mats Dahlström
and
Mikael Gunnarsson,
[preprint version] Throughout the ages, one of the main interests for the librarian and LIS research has been different types of referential and bibliographical databases. Bibliographic databases (including catalogues and indexing & abstracting services of different kinds) have been studied as compound meta-objects 1 providing pointers to and descriptions of other objects of primary interest to the end-user. Characteristics of these meta-objects have been critical factors in determining the values of their use. Some of these characteristics may be metaphorically described as belonging to the architecture of meta-objects, which in practice is usually expressed in the "hierarchical" structuring of databases into records, fields and subfields or the "relational" structuring of databases into entities and relations. In no respect has the same attention been given to the kind of objects - the textual containers - that they point to, as to their characteristics other than those relevant in predefined cataloguing practice and, in the case of citation indices, other than those that link scientific works together. At least, this is true for the LIS domain in Sweden. Moreover, in Knowledge Organization, the preoccupation is with the rules and methods for classification and indexing. It seems as though one is reducing the objects of "input" to their mere possible inherent meanings. The basic material form and the factual, as well as the more abstract, textual structure of the objects have, to some degree, been regarded as transparent and self-evident, almost axiomatic starting points, offering little or no room for problematization and discussion. In this paper we argue for a revaluation of this somewhat delimiting perspective, which we see as a per se reasonable outcome of the multicentennial hegemony of print technology. As print technology is more and more becoming only an alternative to modern technology, and as the latter is on its way to reframe the ways in which meta-objects might be construed, we want to emphasize the practice of document studies, hitherto largely neglected in LIS education and research. These studies must be conducted free from media chauvinism, and may be realized in such subfields as e.g. production, management, delivery, organization, history, theory, materialities and sociology of documents as well as, of prime interest to this particular paper, document architecture (DA). In this respect we may also relate to a number of disciplines - e.g. literary theory and bibliography - where basic assumptions have been questioned pertaining to that supposed basic atomary unit in the architecture of large document corpora, notably the codex book or a journal issue. Digital carriers for bodies of text can only with great difficulty, and quite often misleadingly, be handled by traditional, print-based institutions. A brief consideration of digital documents vs systems such as legal deposits or descriptive cataloguing might by itself suggest our great need for new fields of research, education and course syllabi in LIS schools 2. Studies of digital production and distribution might also prove to be of value in re-theorizing technologies for reading and writing and their conditioning of textual carriers. This crossover may yield a reassessment of document architecture in its own right, and might thus make way for a rebirth of a sub-discipline largely disregarded in our part of the world. But let us first dig somewhat deeper into the very concept of DA. The notion of document architectureWith document architecture (DA) we mean several things implied by the metaphor architecture, besides the formal definition of ISO 8879:1986 which reads "Rules for the formulation of text processing applications" (ISO 8879:1986 clause 4.97). We believe that as the architecture of a building discloses a lot of for example the architect, his skill, the architectural style and its underlying truth-holdings of a functional and aesthetic building - DA discloses a lot of the practices and underlying theory of the production of the document. DA as a concept is closely connected to a certain kind of "text processing applications", general markup languages (GML), where content is separated from presentation. This structural strategy for digital representation is contrary to strategies, where documents are represented and processed with respect to their outlook: their positioning on the screen or on a sheet of paper. The latter perspective makes no difference between a picture and a sequence of alphanumeric characters. The former takes account for and stores unambiguous information on a document's structural elements, and is exemplified by SGML, XML and originally even HTML. 3 This strategy may also contradict traditional text production strategies, where the writer tends to focus on appearance. Our observations point to the fact that students learning HTML have extreme difficulties in understanding this concept of separation. 4 In the context of general markup languages the concept of DA serves the purpose of signifying every more or less compelling system of rules that "deals with the document's purpose, its audience, its possible media, and the variety of ways that the document will be played, performed, displayed, accessed, transmitted, or read". (Turner, 1996, p.10) Thus a bundle of instructions on how to render a document on a screen (such as a style sheet), as well as how packets of metadata may be inserted into the document, is part of the DA. However, as all these rules depend on more or less temporary conceptions of how we might adequately produce documents and on technological constrains 5, the study of DA has a much broader scope than the purely technical bits. The metaphor may then be said to apply both to how these ideal models of text production work and become expressed in different media, both digital and analog, as structures of different document types, and how these models (or styles) form taxonomies of different but related styles. DA may then be studied as a matter of how to build, and from the perspective of aesthetics as investigations in different styles for different epochs, cultures and genres. And why not in the context of a "sociology of documents", as John Seely Brown and Paul Duguid seem to suppose? To fully assess the document's evolving role requires a broad understanding
of both old and new documents /…/ They are also a powerful resource for
constructing and negotiating social space. (Brown & Duguid, 1996)
With this conceptual background of DA, let us now turn to three contexts, where DA might be a tool for fuller understanding: information seeking, digital (re)production, and theories of technology. Document architecture and its importance for Information seekingInformation seeking is often described as a "constructive process characterized by uncertainty and confusion", where an "information search is a learning process". (Kuhltau, 1994, pp 8-9) Thus the description seems to advocate for a perspective on research and education based on user perspectives, leaving out the system-oriented perspectives of the so called "bibliographic paradigm" (Ibid., p. 1). There is no denying the fact that information seeking is always performed as actions highly dependent of personally situated and contextual factors, but it is our view that if the impact of a changing technology on information seeking will be determined, this must include an analysis of the relationships between DA and document representation - an analysis that calls for questions usually attributed to the domains of indexing and cataloguing, but extends their scope radically. If we accept that analyses of information seeking missions sometimes may be characterized as investigations into how articulations of the needs of the users relate to the way documents are represented in different types of meta-objects, then we are also allowed to study the interplay of DA, document representation and user in an information seeking context. As far as we know, this kind of study has hitherto not been dealt with explicitly to any satisfying degree in LIS domains. This is not the forum to elaborate further upon this, merely to point out how different DAs are comprehended by different actors, and to hope for increased interest in these questions. Further empirical investigations into these matters would be welcomed, as we base our presentation here solely on our acquaintances with the web and our observations of how students deal with it. Examples of how architectural changes affects information seekingIn learning to use the web students in general have to get rid of several misunderstandings of the underlying techniques. Pollock & Hockley (1997), in a 1995 study on PC-literate users conducting searches on the web, came to the conclusion that "users need at least some understanding of basic Internet concepts in order to carry out successful searches". This study shows, among other things, how users fail to understand that articulation of their needs must be formulated according to how technology makes up a search language. Among LIS students the awareness of this is increasing, but the situation is still too much alikened to bibliographical database searching. There seems to be a strong tendency towards being as precise as possible in the hope that search engines will present final answers. It certainly would be better if search services expressed "search results as 'suggestions' rather than 'hits'" (Ibid.) The misconceptions of how web searches may be performed are fairly natural as bibliographic systems always have been important for the LIS field and are emphasized in most courses on information seeking, where the main question at hand seems to be to find relevant textual objects and discriminate irrelevant ones. In the following we will emphasize some architectural conditions of web documents that affect the use of search engines and other search services on the web. An example will be thoroughly elaborated, viz. how the concept of the title of a document plays an important role in a known-item search. Different worlds - different titlesA known-item search occurs when the user has a limited but correct description of an existing document. The user is sure of the fact that the document exists, that its title and author are explicitly stated somewhere in the document, and these assumptions are true to the actual state of the docuverse. This situation is fairly different from when a user searches for unknown documents. 6 A known-item search in a bibliographical database is highly supported by the predictable, traditional DA. Almost every book has a stated title, responsibility, editor etc. Different codes for document description, as for instance the AACR2, prescribe that a representation of a document must include a statement of a document's title and author. It is obvious that, to some degree, conventions for producing documents have been adapted to facilitate such descriptions for the producer. The CIP is the most obvious example, where a bibliographical description is inserted into the document in a prescribed position. Thus documents making use of the CIP is an instance of a document type that defines a position as well as a form of expression for metadata 7 conforming to library practice. These metadata are extracted, in accordance with cataloguing rules, from other parts of the document, e.g. the title page, which in its turn is another element of the conventional DA of a book. Consequently, the user searching for a known book may rely on the fact that what he or she thinks is the title of a book is also true with respect to its representation in bibliographical databases. Exceptions are of course not too rare, due to orthographic fallacies and the like, but there are apparent correspondences between rules for generation of document representations in bibliographical databases and common sense apprehensions. The user carrying out a search on the web, in Alta Vista, Infoseek or some other "search engine" can not rely on such a correspondence - for several reasons. For example, according to Gudivada et al. (1997), it has been estimated that 20 % of objects on the web lacks the title tag. Admittedly, formal HTML specifications do prescribe the declaration of a title, but since current web browsers seem more or less to ignore if the HTML object conforms to formal rules or not, and since the producer of a web document not always recognizes the importance of significant titles, the result is that many web documents anyhow lack the title or have titles that make no sense. The former problem may become less frequent with a transition from HTML to XML, as the latter imposes more strict regulations on the producers of browsers (XML processors). This may happen if XML processors, as is suggested (Goldfarb, 1998), won't accept ill-formed documents, which means that an XML object, nonconformly encoded, will not be rendered at all and consequently will be unusable. With respect to this possible evolution it may be said that XML architecture is imposing further constrains on the writer and seems limiting, but at the same time makes expressions of this architecture more predictable, for generators of document representations as well as for readers. In the meantime it is a fact that the user of a search engine, especially a LIS student, expects that a search term attributed by title: 8 will match all documents with that term in the title. The fact is though that as the (formal) title is that part of the text manifested in the upper bar of the window, both readers and writers tend to regard the first heading or another visually prominent part as the title. Neglecting the meaning of the title tag may also cause the writer to omit it. The mismatch of the architectural definition of a title and the user's is obvious, and may be of critical importance, especially when the writer has represented the title with an in-line image, as images aren't indexed at all by search engines. As cataloguing codes are beginning to adapt to new media in order to incorporate pointers to resources on the web in library catalogues, rules for determining the title of web documents are formulated. This is as far as we can see not done in a way that takes account of already existing (technical) definitions of the title or the unique architectures of web documents. In fact the ALA in a report last spring came to the conclusion that existing schemes for metadata (TEI and the Dublin Core) are of limited use in catalogue integration. It all seems as a tentative strategy to treat web documents as expressions of the (inherently different) architecture of journal articles, books or other types of printed documents. This strategy often concludes that the title is the visually most prominent part of the web object's rendition, usually a declared heading or the text of a GIF or JPEG image, in analogy with AACR2 rules for book description, which (naturally) makes references to visual appearance. This may lead to a situation where not only do search engines and users diverge in their definitions of the title, in their models of what constitutes the architecture of a web site, but also where cataloguing rules develop another conception of the DA of web objects. So if we say that different worlds take account of different titles, we may also say this about other architectural phenomena. In a recent article on "citation in the digital era", Mats G. Lindquist remarks that "libraries are tied to the book and consider journals to be fragmented books". (Lindquist, 1999) A statement that may be compared to the remark of Luciano Canfora, that "[f]or librarians, the scroll was the 'unit of measurement'", implying that estimations of the contents of the Alexandrian library in most sources were highly exaggerated, since they were derived from "the practice of counting not works but scrolls". (Canfora, 1990, p. 189) Since 1994 we have seen that users may open files, documents, URLs and sites in the Archive menu of the browsers. At the moment we seem to be opening pages (in Netscape), though the word page here merely resembles what is usually meant by that word. Abundant examples witness to the fact that, as metaphors for a new technology are derived from older information technologies, this may cause unnecessary ambiguity 9. The concept of document is in itself somewhat of a metaphor, whereas it signifies a way of establishing borders for a coherent whole, by the help of reifying a conception of a collection of entities into a coherent work, a document. Consequently, we believe that all this points to the relevance of a DA perspective, as it is important to elucidate the unique nature of the outcomes of a modern technology. If we understand technology in its broadest sense, not restricted to modern technology, we may approach its essence in much the same way as Heidegger in The Question Concerning Technology (1977), where technology is described by traditional "instrumental" and "anthropological" definitions. Technology, then, may be seen as "a means to an end" where "human activity" takes place in "bringing-forth". Thus writing and reading may be seen as activities for bringing forth human expressions - thereby using tools, which put necessary constrains on the activities. In these activities there is no way of becoming independent of technology. Language as well as inscribing tools (like the pencil or the keyboard) and storage media form a technology for writing and reading. A technology in which we "dwell", rather than use as a simple tool. Thus, if we accept this view of technology as something that transforms, as well as is transformed by, mankind, any change in technology will affect the possibilities of expression, and this must be regarded as a critical factor in analyzing information seeking. Document architecture as a tool for understanding the production and construction of literary worksThis view of technology might be of some assistance to us when considering the importance of DA in the study and theory of markup languages as tools that shape, as well as are shaped by, our conception of possible DAs. As noted above, the term DA even originates in the realm of markup practices. Markup as a theory of DADA becomes important when trying to understand metagrammars or schemes of how textual works are and can be constructed, most apparently so when we deal with the aforementioned markup languages for either the direct production of new digital works or the digitization of works previously represented in paper documents. As Michael Sperberg-McQueen notes (1991), a markup of a text 10 is in fact a theory, or a theoretical statement, of this text. Extending this view, then, a general markup language is a general theory of texts. Note that Sperberg-McQueen speaks of "texts". It is perhaps more appropriate, though, to use his statement not to "texts" 11, but rather to documents, and specifically to the architecture of documents, as a markup language is a tool for structurally handling aspects and elements on different hierarchical and conceptual levels of the document, not only textual strings at the linguistic level. In this way we might consider certain families of markup languages as theories of particular types of documents or of genres, and as statements as to how these documents or groups of documents can in fact be constructed at all, and, finally and importantly, as tools perhaps shaping the very production of documents and their architectures. Consider the universally intended grammar of SGML. SGML is not a markup language, but might instead be regarded as a metagrammar, rules for how actual markup languages in turn might be constructed. In practice, user communities collaborate and discuss what types of elements are needed for certain types of documents and their markup (in particular when digitizing older literary works), and a formal Document Type Definition (DTD) is agreed upon. This process has to include the troublesome definition and consequent agreement on what are the important inherent elements in the documents at hand, i.e. an agreement on what constitutes the documents 12. This DTD has to be followed in order for documents to be universally exchangeable and processable within the intended user group. SGML was from its beginnings bound to the notion that DAs in general were characterized by ordered hierarchies of content objects, textual blocks that could not overlap. This was, in a manner of speaking, the nature of documents according to this theoretical view. Attempts were even made to establish theories of the ontological status of texts (or rather: documents) as consisting of content objects in a strict hierarchical order (deRose et al, 1990 ; summed up and elaborated in Renear, 1997). This underlying philosophy of SGML defined and constrained the development of secondary applicational grammars - markup languages - such as TEI and HTML, thereby restricting these tools to a particular concept of DA. It soon turned out, however, that there in fact existed a number of document types that did contain overlapping hierarchies 13, contrary to the general theory of SGML, and therefore could be marked up according to a DTD of SGML only with considerable loss of adequacy. Where does that leave the "general" in General Markup Languages? The explosive growth of the web and the consequent widespread use of HTML and its tools, along with more or less explicit demands of well-formed (as regards the validity of the encoding) HTML documents, have resulted in equal growth and spread of this architectural conception of SGML. The architectural view, upon which SGML and HTML originally were based, thereby might justify itself in a, as it were, roundabout way. The technological tool, the thing with which to write, will affect the thing written. The particular DA conception in SGML and derived languages will affect the production, collection, definition and analysis of documents, both as documents are originally produced electronically, and as paper-bound works are being digitized 14, and might consequently be of major interest to LIS. A musical interludeThere is a useful analogy to be obtained from the history of music and the notation of music, begun in the 19th century 15. During Romanticism there began a movement to notate (and thereby to preserve) the music that hadn't so far been written down, notably folk music. The notation was performed according to current notational schemes (cf. our markup languages for "notating" literary texts) and current musical theory. These schemes and theories, however, neglected unorthodox elements such as e.g. quartertones, and consequently did not offer notes to textualize these phenomena, however important they in fact were in folk music. The notation had to be adjusted to the tools (and to the general theory of music) at hand at the time. Note also that the notators at the time did in fact regard quartertones and the like as false or improper, and so were convinced that they notated the music in an adequate way, correcting improper anomalies of the works. These tools - and their concept of the possible "architecture" of the notation of musical works - have thus shaped the way we are able today to recreate these works of art. Following the development of musical notation, it is also interesting to note a) in what way less and less information has been attributed to each separate note, whereby more and more information has been attributed to the interplay or the syntax of the notes, b) how the development of instruments during the 19th and 20th century has been adjusted to the conditions of the notational tools and for instance the intervals allowed by the scheme, and c) the social organizing of musical life (education and training, choir and orchestra organization etc) as formed by the systems being developed, resulting i.a. in each individual musician in an orchestra having narrower and more distinct duties, while the whole performance of the works tended to grow more and more complex. Polyphonic music as shaped during the last century, Sinding-Larsen concludes, wouldn't have been possible to develop without the notational tools and their conditions. What is to be drawn from this musical divertissément? Again, that the tools for description and the general theory of the ontological status of particular works of art shape the way we actually are able to produce and reproduce the works. With this in mind, we might begin to analyze in what way text application tools such as word processing software or markup languages condition how we are able to produce and reproduce (when digitizing) works, and in what way they constitute statements on possible DAs, as well as how the tools will affect e.g. literary genres and the social organizing of literary life. For the 19th century notators, part of their mission when notating music was to disclose the inherent essence of music, in much the same way the developers of (general) markup languages now are trying to disclose the inherent essence of texts and documents. In order to be able to adequately handle possible future document production and architectures, it is of utmost importance in LIS education and research to closely follow these attempts and perhaps also to be part of them. Document architecture as an object of inquiry into information technologyThe use of traditional IR systems tends to place modern technology as a barrier to products of another technology. Modern technology is treated as a mere tool for retrieving representations of information residing in a framework of print technology. This may be the most fruitful way of interacting with information systems in everyday practice. However, this apprehension may be of less use when learning modern information technology as such. Mastering information technology, for example word processing, is not the same as to have an understanding of it. It is possible to learn word processing and in the process acquiring no or little understanding of how technology makes word processing possible. This results in problems in some cases, as in the exchange of documents between two subsequential versions of the same word processing software or when changing to another word processing software. The transition from Wordperfect to MS Word has for most users not been an easy one, even though there are apparent underlying similarities between their respective architectures. It is an interesting question to ask, why mastering the techniques of particular software and hardware solutions isn't enough and if it really isn't. We do not believe that it is, and the answer may be approached from different perspectives. One of the reasons is the fact that modern information and communication technology conceals a course of events that takes place in the performance of certain tasks. Hidden under sophisticated and supposedly user-friendly interfaces are rather primitive but numerous events. It would be possible to say that this essence in modern technology tends to alienate its users, as was a frequent argument during the 60s and 70s, when computers came to be symbols for "big brother" 16. Be that an exaggeration or not, in the context of education this nature of concealment is certainly problematic. If Bruner's statement that "the heart of the education process consists of providing aids and dialogues for translating experience into more powerful systems of notation and ordering" (Bruner, 1966, p. 21) is worth considering, and we believe it is, then a major task will be to unveil the concealed courses of events in a way that facilitates the experience of technological foundations. The reason for the student's difficulties to understand concepts such as file, server or network may be that they, essentially, are elaborated abstractions. Consequently all those terms turn out as abstractions of unexperienced phenomena. Then the teacher finds him- or herself in a position where s/he has to tell beautiful stories the student hopefully will believe and remember, and/or instruct the student on how to perform a discrete and purely instrumental task. There is no true magic inherent in technology. It's just that all hardware and software are built on a great amount of concealed abstractions. Several of these abstractions are founded on different theories on the nature of language and knowledge, as many investigations show.17 Is it possible that if the underlying assumptions of for example representation techniques could be experienced and studied in a situated task, a thorough understanding would become possible? Word processing is a common task that every student is expected to carry out. Problems with student's motivations and lack of interest in learning word processing, so often observed in other courses, are often absent. There is simply an obvious relation to necessary tasks. Word processing as a means to technology education should therefore seem like a good starting point for understanding technology. However, it is a fact that problems encountered during these tasks are proprietary to particular products. Even though several tasks in for example MS Word will be performed similarly to those in other word processors, they are seldom identical. The task of text alignment or line break in general can not be learned by the use of one word processing application alone. Another example is the insertion of in-line images, which can be done in several ways, described by software designers in ambiguous dialog boxes of the interface. How is the first-time user expected to recognize the difference and its consequential meanings between storing the image in the document as opposed to linking to an image, when both ways seem to produce the same effect? Technology and the solution of tasks tend to stay intimately intertwined, which makes it almost impossible to decontextualize problem solving and make way for learning, or with the words of Bruner again, technology hinders "independence of response from the immediate nature of the stimulus". (Bruner, 1966, p. 5) A study of explicit DAs in relation to a task like this one would possibly be a remedy to this restrained growth. As mentioned in the beginning of this paper, DA deals with "rules for the formulation of text processing application", but rules are not amenable nor explicit in the context of word processing applications. Such rules are in fact found among the general markup languages (GML) like SGML, XML and HTML, where technology may be stripped bare of its sophisticated interfaces. GML reveals a lot more of what is going on "inside". Users frequently delight in seeing that a simple string of short text may cause an in-line image to appear in the text, or to see that another string ties an object to a different web site by a blue underlining. The same applies to changes made in a separate style sheet that may affect the colour and font of all first headings in a large complex web site. This is like playing with technology, not just using it as a means to an end. 18 The need for DA implementationThere are numerous practices within LIS education and research where DA studies might prove to be an essential tool for gaining knowledge. A few instances have been addressed in this paper, but given time and space, we might of course broaden our scope to even further areas of LIS, for instance: a) the ever growing need for quality aspects and critical analysis of web contents and forms, and b) the overwhelming, new situation in digital environments, where the malleable screen presentation form of an object is separated from its storage form, in which two cases an understanding of the fundamental DA is crucial. A large number of traditional institutions and systems in library environments are encountering obstacles in the digital world on account of their being essentially print-based. The established systems can be "bent" to accommodate the qualities of digital documents - or the other way around. The obstacles can thus temporarily be by-passed through ad hoc-arrangements, but will not be adequately handled, unless solutions and theories are generated on the basis of thorough understanding and analysis of the architectural essence of documents in general and, in the examples stated, of web environments and digital objects in particular. This makes way for an architectural understanding of different kind of media, structures and meta-objects. LIS education and research ought therefore, in our opinion, implement the field of DA in order to perform necessary investigations of existing and future developments of documents and meta-objects. Students would be better prepared. LIS itself would be better prepared. Footnotes1. The term meta-object is borrowed from William Arms (1997) and his colleagues' description of an "architecture for information in digital libraries". The term is for our part meant to indicate that many objects that serve as pointers to library material are not really bibliographic.2. The reassessment needed pertains, of course, largely to the challenged concept of document, a concept "rooted in hundreds of years of tradition, planted firmly in enormous and complex systems for publishing, organization, and access. Yet it has become increasingly evident that the archetypal concept of 'document' as 'book' underlying these systems is insufficient to deal with a multitude of media formats, particularly diverse electronic formats (...)". (Schamber, 1996, p. 669) 3. It may be important to point out that even though the SGML standard dates back to 1986, work on markup languages based on separation of content and presentation dates back to the late 1960s, according to one of SGML's developer's, Charles Goldfarb (1998). 4. Daniel Chandler (1995) attempts a thorough description and classification of writing strategies, where his "architectural strategy" is most akin to that implied by general markup languages. 5. Something Allen Renear (1997) and C M Sperberg-McQueen (1991), i.a., have pointed out. 6. This is roughly equivalent to Jennifer Rowley's distinction of "known-item searching" and "browsing". (Rowley, 1992, pp. 5-6) 7. What has been said about meta-object may also apply to this now widely spoken and often misunderstood term. Meta-objects contain metadata, but is not always bibliographical data. 8. An attribution possible in using several search engines, e.g. Alta Vista and Infoseek. 9. Of course, metaphors such as these are necessary elements of the architecture, as (along with navigation systems, indexing etc) "the glue that holds together a web site". (Rosenfeld & Morville, 1998, s. 11) 10. That is: the attribution to the linguistic elements of a text with structural (and perhaps also presentationally intended) metainformation contained in "tags". 11. If we by "text" refer to the sequence(s) of linguistic alphanumerical signs stored in and / or manifested on a physical carrier, a carrier which, together with its text(s), constitutes a document. 12. A task that might prove to be difficult indeed. When digitizing paper-bound works and their textual manifestations, we frequently have difficulty identifying, explaining, and explicitly tagging a great number of features in such works, as we have grown so accustomed to them that they have become more or less transparent. 13. For instance printed drama, see Huitfeldt, 1999. 14. Interestingly enough, digitization has offered opportunities for reframing the contents and architectures of manuscripts and printed books, forcing the digitizing agents to express, code and group digitized reproductions of not only the documents as entities, but also parts of the documents, whereby documents and parts of documents can be handled in numerous ways, e.g. restructuring, searching, downloading, printing (again!) and commercializing (The Center for Retrospective Digitization, 1999 ; Klaproth & Lossau, 1998). This makes way, in a manner of speaking, for a more literal version of the infamous deconstruction of works, but this time a deconstruction not of the inherent meanings in works, but of the direct textual expression as manifested in the physical documents. 15. This paragraph owes greatly to the work of Sinding-Larsen (1988). See also Huitfeldt, 1999. 16. At another end of this chain is the practice of commercial software producers to "protect" from the user some patented layers of manipulation, thereby, in a sense, rendering the user impotent. This practice has been analyzed and criticized by e.g. literary theorists such as Friedrich Kittler (1994). 17. See for example Richard Coyne's (1995) thorough exposé on the theoretical foundations for computational design or the works of Joseph Weizenbaum (1984), Terry Winograd and Fernando Flores (Winograd & Flores, 1986). 18. Which, peculiar enough is something that's not allowed in some of our public libraries, where actions taken by the help of computers must be good for something else or, as they put it, a task of searching ("söka"). References(All web references checked March 31, 1999.)Arms, William Y., Blanchi, Christophe & Overly, Edward A., (1997),
"An Architecture for Information in Digital Libraries". // D-Lib Magazine.
- February.
Brown, John Seely, & Duguid, Paul, (1996), "The Social Life of Documents".
// First Monday. - Vol. 1 : 1.
Bruner, Jerome S., (1966), Toward a Theory of Instruction. - Harvard UP. Canfora, Luciano, (1990), The vanished library. - Univ. of Calif. Press. The Center for Retrospective Digitization, Göttingen State and
University Library, (1999). - Göttinger DigitalisierungsZentrum,
Niedersächsische Staats- und Universitätsbibliothek Göttingen.
Chandler, Daniel, (1995), The Act of Writing : A Media Theory Approach.
- Prifysgol Cymru (University of Wales).
Coyne, Richard, (1995), Designing Information Technology in the Postmodern Age. - MIT Press. DeRose, Steven J., Durand, David G., Mylonas, Elli & Renear, Allen H., (1990),"What Is Text, Really?". // Journal of Computing in Higher Education. - Vol. 1 : 2 (winter), 3-26. Goldfarb, Charles, (1998), The XML Handbook. - Prentice Hall. Gudivada, Venkat N. et al, (1997), "Information Retrieval On The World Wide Web". // IEEE Internet Computing. - Vol. 1 : 5, 58-68. Heidegger, Martin, (1977), The Question Concerning Technology and other essays. - Harper & Row. Huitfeldt, Claus, (1999), "Tekstkoding og tekststrukturer" // Datahåndbok for humanister / Ed. Espen Aarseth. - Ad Notam, Gyldendal. - 123-146. Kittler, Friedrich, (1994), "Protected Mode" // Computer als Medium / Friedrich Kittler, Norbert Bolz & Christoph Tholen. - Fink. - 209-220. Lindquist, Mats G., (1999), "Not Your Father’s References : Citations
in the Digital Space". // Journal of Electronic Publishing. - Vol.
4 : 3.
Pollock, Annabel, & Hockley, Andrew, (1997), "What's Wrong with
Internet Searching". // D-Lib Magazine. - Vol. 3 : 3.
Renear, Allen, (1997), "Three (Meta)Theories of Textuality" // Electronic Text : Investigations in Method and Theory / Ed. Kathryn Sutherland. - Clarendon. - 107-126. Rosenfeld, Louis & Morville, Peter, (1998), Information Architecture for the World Wide Web. - O'Reilly. Rowley, Jennifer, (1992), Organizing Knowledge. - Aldershot. Schamber, Linda, (1996), "What Is a Document? Rethinking the Concept in Uneasy Times". // Journal of the American Society for Information Science. Vol. 47 : 669-671. Simone, Raffale, (1995), "The Body of the Text" // The Future of the book / Ed. Geoffrey Nunberg. - UCLA Press. - 239-251. Sinding-Larsen, Henrik, (1988), "Notation and Music : the History of a Tool of Description and Its Domain to be Described" // Artificial Intelligence and Language. Ed. Henrik Sinding-Larsen. - Tano. - 90-114. Sperberg-McQueen, C.M., (1991), "Text in the Electronic Age : Textual Study and Text Encoding, with Examples from Medieval Texts". // Literary and Linguistic Computing. - Vol. 6 : 1, 34-46. Turner, Ronald C., Douglass, Timothy A., & Turner, Audrey J., (1996), Readme.1st : SGML for Writers and Editors. - Prentice Hall. Weizenbaum, Joseph, (1984), Computer Power and Human Reason. - Penguin. Winograd, Terry, & Flores, Fernando, (1986), Understanding computers
and cognition. - Ablex.
This web version created April 1st, 1999.
|