Thursday, 30 October 2008

CFP: International UDC Seminar 2009 "Classification at a Crossroads", The Hague, 29-30 October 2009

Following the success of the first International Seminar on UDC, the second in a series of biennial conferences entitled Classification at a Crossroads: Multiple Directions to Usability will take place on 29-30 October 2009 in the UDC headquarters at the Koninklijke Bibliotheek in The Hague.

The 2009 Seminar aims at exploring how new developments in information standards and technology influence and affect applications and services using classification, Universal Decimal Classification in particular, and its relationships to other systems.

The Seminar programme will highlight many ways in which the use classification can be improved. Attention will be paid to the applications of classification in supporting multilingual access, user-friendly representations of classification in resource discovery and semantic searching expansion and classification application across distributed systems.

Papers are now invited on the following topics:
    Classification and semantic technologies, e.g. experiences with vocabulary standards for expressing and porting classification data into the Semantic Web, vocabulary registries, terminology services
    Classification in supporting information integration, e.g. classification use in alignment of vocabularies, classification as a common subject language in co-operative systems, experiences in multi-database systems, classification mapping to other subject languages, classification enhancement with social tagging
    Verbal and multilingual access to classification, e.g. textual searching and display, management of subject-alphabetical indexes, extraction of thesauri from classification schemes
    Classification authority control and library systems, e.g. issues with MARC formats, authority file development, maintenance and sharing of data
    Visual representations/interface to classification, e.g. issues in classification browsing and faceted representation in classification tools and information systems
    Experiences with classification outside the traditional library environment, e.g. use in different types of digital repositories (eprints, VLE), resource discovery on the Web, alerting services, specialised bibliographic services and databases (images, sound), organization of physical objects (museums, archives)
To read more about conference theme and to submit your abstract visit the event's website.

Wednesday, 1 October 2008

Document form in UDC - what about Website/Webpage

Recent discussion on the udc-forum discussion list pointed to the fact that we do need a concept of website/webpage as a simple form number in UDC Table Id - Common Auxiliaries of Form.

The specific question was whether it was satisfactory to denote 'webpage' with a form number for digital documents with alphabetical extensions for HTML:

(0.034.2HTML) Webpage

and whether we can class e.g. Scottish Parliament website as follows:

328.1(410.5)(0.034.2HTML) Parliament - Scotland - Webpage

Which raised question whether it was better to combine document form with the main number for website:

004.738.1 Site. Service node. Sites by type of service

Thus enabling us to say more precisely e.g.

328.1(410.5)(0.034.2:004.738.1) Parliament - Scotland - Website content
and

328.1(410.5)(0.034.2PDF:004.738.1) Parliament - Scotland - PDF document on the website

or

(0.034.2JPG:004.738.1) Digital document - JPG image - Website

In relation to the above Miguel Benito commented:
"I am not still satisfied with the proposals of using the auxiliary (0.034.2) for web pages. For me the auxiliaries (0.0...) are complementary auxiliaries to other form auxiliaries. Even if they can be used separately by themselves we should be very restrictive in this.
We have to find a solution giving web pages a own form auxiliary between (01) and (09). The problem is that the subdivision of the auxiliary (08) where it should be more appropiate is used for a lot of forms while the other numbers as (01) and (02) are used very little and have very few subdivisions.
Maybe (00), or (001) could be a new form subdivision for electronic materials >with posibilities of subdivisions in the future.
"

The Form table in UDC is very important and it is well worth making the effort to put this right. If we try to place the new concept in the table there are some things to take into consideration.

These are top classes in the form table (with 362 sub-classes).

(0.0...) Physical features, etc.
(01) Bibliographies
(02) Books in general
(03) Reference works
(04) Non-serial separates. Separata
(05) Serial publications. Periodicals
(06) Publications of societies, organizations
(07) Documents for instructions, teaching, study, training
(08) Collected, polygraphic works. Forms. Lists. Illustrations. Business publications
(09) Historical form. Legal and historical sources

Looking into the scope of top classes and knowing what is underneath class (0.0...) documents by physical features seems to me the only appropriate place.
However, this is not about 'slotting' notation for webpage as document form numbers need to meet the following requirements:
    1. express attributes/characteristics of the inner form of presentation such as (091) historical, (092) biographical form
    2. distinguish primary, secondary and tertiary sources (documents, bibliographies, encyclopaedias)
    3. distinguish monographic publications and periodicals
    4. express document types by different criteria: by purpose, by type of outer presentation (text, sound, image still, image moving)
    5. express carrier and give a full taxonomy of document carriers: printed, digital etc.

    but most importantly

    6. allow free and unlimited combination of all above (1, 2, 3, 4, 5) as every document can be described by attributes from all first five facets.
It is this 6th requirement that makes the whole task more complicated on the plan of classification notation.

Specifically when it comes to webpages/websites we may need to say encyclopaedia (038) that is on the web, periodicals (05) on the web, (02) book on the web and of course different digital files (pdf, doc, dataset) on the web. So we have to pay attention to the fact that website or webpage is not a document form as such but rather a way in which a document in any form can be published and that this class number will have to be combined with everything that is already in Table Id - Common Auxiliaries of Form.

To express combinations in notation UDC has the following mechanisms

a) parallel division i.e. taking the beginning part of the first number and second part of the second number and amalgamate them together e.g. substituting = in =111 English language with 821 Literatures of individual languages to express English literature 821.111.
This amalgamation produces shorter numbers but it poses problems as it disguises the fact that the number is combined of two concepts both of which we have to manage and access. Hence parallel divisions are something UDC revision tries to eradicate as much as and whenever possible.
In form numbers we have this principle applied in (0.05) documents for a certain kind of audience. To express intended audience of the document the form number has to be 'extended' by numbers for persons.
E.g. -053.2 children should be amalgamated with (0.05) like so
(0.053.2) Documents for children
or
(0.056.262) Documents for partially sighted persons, blind persons [using -056.262]

Here again we have parallel division principle of amalgamation which causes that e.g number for children -053.2 or for blind persons -056.262 cannot be easily discerned, managed or searched for. The option is to change this to (0.05-053.2) and we are now considering of introducing this change in the E&C 30.

So in principle UDC numbers should be expressive with respect to syntax and numbers that are combined should look so. We have three options in achieving this:

b) introducing attributes/characteristics that may be shared by all document form as special auxiliary table, most preferably -1/-9 - this way every attribute we add will begin with - dash

c) combine two form numbers with : colon within parenthesis like so
(05:0.034.44) Periodicals - CD ROM

d) when it comes to form auxiliaries which are enclosed in () it is possible and possibly even desirable that they are listed as a sequence of distinct numbers e.g.
39(05)(0.034.44) Ethnography - periodicals - CD ROM

So when introducing number for webpage/website we have to think through all these scenarios and see which one works the best.

Last but not least - we have to make sure that we maintain a distinction between website/webpage as a subject of study in 004 Computer science as distinct from website/webpage as form of publishing.

Needless to say comments are welcome.