<message id="<2pvk41$35c@styx.uwa.edu.au>" date="2976764481">
Newsgroups: comp.text.sgml
Date: 01 May 1994 07:01:21 UT
From: James Tauber \<jtauber@tartarus.uwa.edu.au>
Organization: The University of Western Australia
Message-ID: <2pvk41$35c@styx.uwa.edu.au>
Subject: Classical Greek Data Content Notation?

I recently designed a classical Greek font that I'd like to make ISO
compliant with regard to coding.

I am aware of the Greek data content notation (where passages of Greek are
tagged \<gk>) so far as it is a transliteration system for the actual Greek
letters themselves but I'm interested if there is a standard for the
diacritics, breathing marks, etc., when they appear in long passages of
classical Greek.

If there is an ISO standard for the Romanization of Classical Greek, would
someone be so kind as to let me know how breathing, etc., is done?

Thanks
--
James K. Tauber, Undergraduate Student
Centre for Linguistics, UWA, Australia
E-mail: jtauber@tartarus.uwa.edu.au
WWW:    ftp://tartarus.uwa.edu.au/pub/jtauber/main.html
</message>
<message id="<19940501.2011@naggum.no>" date="2976794030">
Newsgroups: comp.text.sgml
Date: 01 May 1994 15:13:50 UT
From: Erik Naggum \<erik@naggum.no>
Organization: Naggum Software; +47 2295 0313
Message-ID: <19940501.2011@naggum.no>
References: <24700@dog.cme.nist.gov> \<Cp0oLs.Cpt@hawnews.watson.ibm.com>
Subject: Re: sgml entities

[Christoph Altenhofen]

|   (Hope you have misspelled the entity-call only in your posting.  The
|   semicolon after \&Company.Name is missing, because it isn't followed by
|   a RE, but a comma.)

note that the semi-colon is _not_ required when the character following
the entity reference could not be part of the reference.  the actual text
of the standard reads [353:3]:

    The /refc/ or /RE/ can be omitted only if the reference is not
    followed by a character that could occur in the reference, or
    by a character that could be interpreted as the omitted
    _reference end_.

\</E>
--
Erik Naggum \<erik@naggum.no> \<SGML@ifi.uio.no>  |  memento, terrigena.
ISO 8879 SGML, ISO 10744 HyTime, ISO 10646 UCS  |  memento, vita brevis.
</message>
<message id="<19940501.2012@naggum.no>" date="2976794710">
Newsgroups: comp.text.sgml
Date: 01 May 1994 15:25:10 UT
From: Erik Naggum \<erik@naggum.no>
Organization: Naggum Software; +47 2295 0313
Message-ID: <19940501.2012@naggum.no>
References: <24700@dog.cme.nist.gov> <1994Apr29.220917.1558@xgml.com>
Subject: Re: sgml entities

[Eric R. Skinner]

|   If I may comment on the overall approach you are taking, rather than
|   the specifics of the SGML interpretation, I'd like to suggest that
|   using entities for boilerplate text is not the only technique
|   available.
|
|   We'd argue that having to change your entity declarations each time
|   your boilerplate value changes is a bit of a hack, depending on the
|   scale of your system.

it depends on how easily you can redefine entities.  if you have to edit
the document to do it, I would agree.  this is your situation if you buy
any of the major SGML parsers.  if instead you use a parser that supports
IMPLICIT LINK, you can override the entity in the link type declaration:

    \<!DOCTYPE boiler [
    \<!ENTITY company PI "ERROR select a LINK to define `company'">
    \<!ELEMENT boiler (#PCDATA)>
    ]>
    \<!LINKTYPE acme boiler #IMPLIED [
    \<!ENTITY company "Acme, Inc">
    ]>
    \<boiler>
    This company, \&company, cannot be held responsible for anything.
    \</boiler>

\<ESOTERICA>
this could look like a prime candidate for SIMPLE LINK, but the complex
wording in 12.1.4.3 appears to prohibit entity declarations in simple link
type declarations subsets.  this must be one of those minor mistakes that
review processes are for.  SIMPLE LINK is indeed simple, especially for the
implementor, whereas IMPLICIT is somewhat harder.  few have implemented
either, so I suggest we allow SIMPLE LINK to behave as if it were better
defined:

    \<!LINKTYPE acme #SIMPLE #IMPLIED [
    \<!ENTITY company "Acme, Inc">
    ]>

this would allow the same simple link to be used with a larger set of
documents that all refer to the company by entity reference.  this may be
useful if you work for a company that changes its name frequently, such as
many publishers.

a further "complication" to some would be that LINKTYPEs need to occur
between the document type declaration and document instance.  this is a
result of weak entity managers that are unable to extract substrings of
files, but can also be rectified by smart parsers who, upon finding the end
of the prolog, continue to read a set of LINKTYPE declarations from another
file, conceptually part of the same entity.
\</ESOTERICA>

in any case, what we have here is an elegant mechanism to redefine entities
according to processing needs, which also makes it possible to list all the
relevant processing dependencies in the source document, to be selected at
run-time by invoking the linktype.  the use of a processing instruction to
signal an error makes the default value hard to ignore.

\</E>
--
Erik Naggum \<erik@naggum.no> \<SGML@ifi.uio.no>  |  memento, terrigena.
ISO 8879 SGML, ISO 10744 HyTime, ISO 10646 UCS  |  memento, vita brevis.
</message>
<message id="<1994May1.200944.20383@ponds.uucp>" date="2976811784">
Newsgroups: comp.text.sgml,comp.text
Date: 01 May 1994 20:09:44 UT
From: Gary Merrill \<ghm@ponds.uucp>
Message-ID: <1994May1.200944.20383@ponds.uucp>
References: <19940427.082224.679@almaden.ibm.com>
Subject: Re: Single Text, Many Uses

[Michael Priestley]

|   Actually, I think the chunks would end up being at the paragraph level
|   or even smaller; potentially even at the word level, as Gary Merrill
|   suggests, which could prove processor-intensive for anything but very
|   short documents.  Given this, I think SGML has very little to add to
|   the process.  It's still useful in other ways, of course, but most of
|   the SGML tags would be specific to a single medium, and so wouldn't
|   actually be "single-sourced" at all.  For example:
:
|   In these examples, the SGML tags are medium-specific, because the
|   organization of information is medium-specific.  If any DBMS system can
|   format this on the fly, and display any particular medium or "version"
|   and allow me to change it (eg I change Chunk A in medium 1), and
|   automatically bring up Medium 2 at the place(s) where Chunk A is used
|   there, so I can assess the effect of my change in its various contexts,
|   I'd find it darned useful.  If it operates on OS/2, I'll buy it now!
|   
|   But I don't think this is a solution, in itself, that owes much to
|   SGML.  Information is being single-sourced, but its structure isn't;
|   and as I understand it, SGML tags the structure of information.  As has
|   been previously suggested, perhaps this thread doesn't belong on
|   comp.text.sgml anymore, and it should settle in comp.text.
:
|   I hope I've clarified the idea.  I think this is what Gary Merrill and
|   others have been discussing (I could be wrong), and their discussions
|   might make more sense to you now (if what I've said made any sense).

I think that this is a good presentation of the sort of potential
complexity and the possible pitfalls with which I was concerned.  And
again, I think it demonstrates the general and fundamental problems that
are faced in this area -- ones that SGML will not solve simply by being
SGML, and ones which likewise any *other* representation must also grapple
with.

Gary Merrill
--
"No reading, no inquiry has yet been able to remove my difficulty, or give
me satisfaction in a matter of such importance.  Can I do better than
propose the difficulty to the public, even though, perhaps, I have small
hopes of obtaining a solution?  We shall, at least, by this means, be
sensible of our ignorance, if we do not augment our knowledge."  (David
Hume)
</message>
<message id="<2q2eoo$7tb@news.csie.nctu.edu.tw>" date="2976857304">
Newsgroups: comp.text.sgml
Date: 02 May 1994 08:48:24 UT
From: Yue-ming Lan \<csymlan@saturn.yzit.edu.tw>
Organization: ITRC
Message-ID: <2q2eoo$7tb@news.csie.nctu.edu.tw>
Subject: SGML->LaTex->EPS

Dear SGMLers,

I was told to find a solution to convert SGML documents to EPS, I would
like to ask dear netters, how is to best way to achieved this.

My idea is converting SGML to LaTex and then to EPS.  I was told there is
an utility does this, where to find it?  Can it process double byte
character?  (I mean Chinese or Japanese)?  Is there a Japanese version of
SGML2Latex converter?

Many thanks in advance.

Yue-ming Lan

csymlan@saturn.yzit.edu.tw
</message>
<message id="<1994May2.125347.14369@news.lrz-muenchen.de>" date="2976872027">
Newsgroups: comp.databases,comp.text.sgml,comp.infosystems,comp.databases.theory,comp.databases.object
Date: 02 May 1994 12:53:47 UT
From: Martin Josko \<martin@pollux.edv.agrar.tu-muenchen.de>
Organization: Leibniz-Rechenzentrum, Muenchen (Germany)
Message-ID: <1994May2.125347.14369@news.lrz-muenchen.de>
Subject: SGML->DSSSL(or others)->SQL(or others)?

I'm interested at anything concerning the conversion from SGML-Documents
into database languages (e.g., described in SQL, OODB-DDLs?) with the help
of any standards (e.g., DSSSL) or any software (e.g., SINDA) or any other
theoretical approaches.

Does somebody have some hints or comments to this theme?

--
Martin Josko             Technical University Munich
TU-Muenchen              Department of Mathematics
DVS-Weihenstephan        Statistics and Data Processing
D-85350 Freising         Freising Germany

email: martin@pollux.edv.agrar.tu-muenchen.de
phone: +49-(0)8161-71-4506
fax:   +49-(0)8161-71-4409
</message>
<message id="<9405021720.AA02508@pws09.aisg.com>" date="2976888003">
Newsgroups: comp.text.sgml
Date: 02 May 1994 17:20:03 UT
From: Mary Kroeger \<mkroeger@aisg.com>
Message-ID: <9405021720.AA02508@pws09.aisg.com>
Subject: NC SGML-User's Group

March 5th 1994, a small group of young, talented friends gathered to
discuss the possibilities of forming an SGML-User's Group.  We yearned to
discover more about SGML-supported software, how other corporations use
SGML, and how we could make a difference in the future develoment of SGML
Standards.  This group of friends, now officially, the NC SGML-User's Group
meets once a month to exchange ideas, information and insights into not
only the SGML world, but also HyTime, DSSSL, and MathPack.  The NC
SGML-User's Group consists of the most skilled and knowledgeable Document
Analysts, SGML editors, SGML proofers, and Project Supervisors employed by
Gateway Conversion Technologies as well as Technical Writers from other RTP
companies.

On May 7th, the NC SGML-User's Group would like to open its doors to all
individuals interested in attending an SGML information exchange.  This
meeting's agenda will include presentations by the HyTime and MathPack
committees, formed by members of the SGML-User's Group.  The meeting will
take place 7:00 p.m. at Gateway Conversion Technologies, located in
Morrisville, North Carolina.  Gateway Conversion Technologies provides
companies with the latest technology in manipulating and communicating
information electronically.  Supported electronic formats include raster
images, ASCII, SGML, PostScript, and publishing formats such as Interleaf,
FrameMaker, WordPerfect, and Microsoft Word.

For further information please call Mary Kroeger or Suzy Wharam at
+1 919 319 6500.  Or send e-mail to \<mkroeger@aisg.com> or \<suzy@aisg.com>.
We look forward to meeting you!

--
Mary Kroeger
mkroeger@aisg.com
Document Analyst
Gateway Conversion Technologies
</message>
<message id="<840241720D5.61R.123516000@gbm.ocunix.on.ca>" date="2976893810">
Newsgroups: comp.text.sgml
Date: 02 May 1994 18:56:50 UT
From: Andrew Webber \<webber@gbm.ocunix.on.ca>
Organization: Games by Mail, Ottawa, Ontario
Message-ID: <840241720D5.61R.123516000@gbm.ocunix.on.ca>
Subject: Corel Ventura SGML?

I haven't seen this discussed here yet; does anyone know what the "SGML
support" in the upcoming Corel Ventura 5 will include?

The software won't be shipping for another couple of months, but the
upgrade order form arrived on Friday.  One of the utilities listed is
"TagWrite" which I assume is related to SGML.  I wonder whether it will
be an "SGML-to-Ventura" process similar to Miles 33's approach, a
"Ventura-to-SGML" conversion, or both.

Andrew
</message>
<message id="<Cp8HHM.132@watdragon.uwaterloo.ca>" date="2976969226">
Newsgroups: comp.databases,comp.text.sgml,comp.infosystems,comp.databases.theory,comp.databases.object
Date: 03 May 1994 15:53:46 UT
From: Pekka Kilpelainen \<pkilpela@watsol.uwaterloo.ca>
Organization: University of Waterloo
Message-ID: \<Cp8HHM.132@watdragon.uwaterloo.ca>
References: <1994May2.125347.14369@news.lrz-muenchen.de>
Subject: Re: SGML->DSSSL(or others)->SQL(or others)?

[Martin Josko]

|   I'm interested at anything concerning the conversion from
|   SGML-Documents into database languages (e.g., described in SQL,
|   OODB-DDLs?) with the help of any standards (e.g., DSSSL) or any
|   software (e.g., SINDA) or any other theoretical approaches.

There is a paper entitled

    Christophides, Abiteboul, Cluet and Scholl:
    "From structured documents to novel query facilities"

to be presented at the forthcoming PODS/SIGMOD'94.

From the abstract: "This paper describes a mapping from SGML documents into
an OODB and a formal extension of an OODB query language in order to deal
with SGML documents storage and retrieval."

The authors' e-mail address: {Vassilis.Christophides, Serge.Abiteboul,
Sophie.Cluet, Michael.Scholl}@inria.fr.

--
Pekka Kilpelainen	Univ. of Waterloo, Dept. of Computer Science
			pkilpela@watsol.uwaterloo.ca	(519) 885-1211 x6229
</message>
<message id="<michael_will-030594121302@m71007.esl.com>" date="2976984782">
Newsgroups: comp.text.sgml
Date: 03 May 1994 20:13:02 UT
From: Michael Will \<michael_will@smtp.esl.com>
Message-ID: \<michael_will-030594121302@m71007.esl.com>
References: <767060935.AA00364@bcnz.ac.nz>
Subject: Re: Single Text, Many Uses (was Re: Real, true uses of SGML)

[Errol Thompson]

|   I beleive that the traditional prose for printed material is not a good
|   approach to implementing online information.  However there are
|   techniques for analysing information that lead to different styles of
|   presentation in both the printed and online media.  One such method is
|   Information Mapping.
:
|   What I am endeavouring to say is that the approach to analysing the
|   information becomes more critical when you shift from print media to
|   online but that a good analysis approach such as Information Mapping
|   may resolve those difficulties.

Could you provide reference(s) to "Information Mapping" analysis technique?
Thanks in advance.
</message>
<message id="<2q7at5$bp3@netprod1.aisg.com>" date="2977017189">
Newsgroups: comp.text.sgml
Date: 04 May 1994 05:13:09 UT
From: Sam Stevens \<sstevens@aisg.com>
Organization: Gateway Conversion Technologies
Message-ID: <2q7at5$bp3@netprod1.aisg.com>
Subject: FOSI

FOSI?  Wasn't he that guy on "Happy Days" with the leather jacket and the
attitude?  No?  Well, you'd better start at the beginning.

As a book designer, I have always provided my clients with a SPEC SHEET.
Now they want a FOSI.  So if you have _any_ info, your karma would benefit
by your sending it (the information) to me.  Thanks.

--
=========Sam Stevens==sstevens@aisg.com================================================
I do not represent Gateway. I'm not sure I'm even allowed in that part of the building.
=======================================================================================
</message>
<message id="<2DC7EB42@noak.vxo.telub.se>" date="2977031640">
Newsgroups: comp.text.sgml
Date: 04 May 1994 09:14:00 UT
From: Peter Bergstrom \<pebe@telub.se>
Message-ID: <2DC7EB42@noak.vxo.telub.se>
Subject: Again: HyTime links in practice

Hi, once again!

Since my first posting on this subject, I've been re-writing much of the
HyTime code for FMV Grund-DTD, due to Eliot Kimbers very useful reply.  I
have also gone through the HyTime standard in detail once more, and sorted
a lot of things out.  I'm sorry it took me so long to get back, but it took
some time...

I've also tried to follow the other threads that sprung up from my post,
and it has been very interesting.  Basically, I understand Carla's, Gary's
and others standpoints, and I'm not fanatic about what we are trying to do
(I hope).  Eliot Kimber pointed out that sometimes compromises have to be
done, and gave the IBMIDDoc as an example of a good compromise, and I
believe we're trying to solve basically the same problem here.  I also
believe Wayne Wohler made a very good conclusion, and gave us all some
things to think about.

However, I'd like to go back to Eliots first reply and try to clarify a few
things, and ask some new questions, still believing that I'm not the only
one struggling with the HyTime standard and it's sometimes cryptic
explanations.

[Eliot Kimber]

|   In a HyTime application, elements that don't have any specific HyTime
|   function are normally declared as either form "HyBrid" or form
|   "sHyTime".  HyBrid-form elements are simply able to contain other
|   HyTime forms, while sHyTime-form (suppress HyTime) elements suppress
|   any HyTime processing within them (note that the form sHyTime is a
|   renaming of the form nHyTime as defined in the Catalog of HyTime
|   Architectural Forms, available on the SGML Archiver server).

Which form should I use, nHyTime or sHyTime?  Is it an error to use one or
the other?  What is used by for example HyMinder?

[Regarding elements "include-data", "include-nonSGML" and "reference":]

|   I may have missed something, but it's not clear how this element serves
|   to include things, unless it's to cause the inclusion of the element
|   that contains it (in which case, why not point to that element
|   directly).
:
|   I would expect this element to have an ENTITY attribute that names the
|   data entity actually being included.
:
|   I wonder if the descriptions of the anchors haven't been confused with
|   the links to those anchors, hence my confusion?

I'd better clarify:

Those three elements are used *only* when you want to refer to or include
something.  They are simply HyTime anchors for an independent link, and
they are always the source anchor.

These elements are located at the exact point of reference or inclusion,
just as if a context link (clink) would have been used instead.  We don't
want the linking information in the modules, and therefore don't use clinks
at all.  You are right, they don't *do* anything!  Their names are maybe
misleading to you and others who understands the underlaying syntax, but we
believe the names are useful to authors (but the SGML comments I made for
these elements were incorrect, since they were directed to authors, not
SGML/HyTime specialists).

I think I'll explain this even more, since I don't believe it was very
clear in my earlier posting.

We envisage placing all links in one, and only one, module for the entire
system.  In all information modules, there will only be anchors, and they
will know nothing about links at all.  The "link module" will always act as
the hub document, since it contains all links (and entity declarations for
that matter).  Even if I only want one single module I need to start from
the link module in order to link in graphics, tool names, etc, which are
all linked through HyTime independent links, found in the link module,
i.e., the hub document.

One could argue that we could use several link modules, maybe even one for
each information module, but we see a very great benefit of using only one
link module: The link module could, in an application such as the FMV
storage system, be modeled with STEP/Express and converted into a data
base, i.e., a good part of the data base structure would be given, and we
would have a good start for a document management system.  The HyTime
syntax, and the link module, would be created from this data base at
export.  They are only necessary for interchange.  And then, they describe
all links between the modules and elements in a unambiguous way by the use
of Hytime.  Of course, an application could also be built around the link
module, and use a HyTime engine to manage all links.

Another benefit of using one single link module is that we don't have to
duplicate Entity references and namelocs/nmlists pointing at the same
module/ information.

We understand that this concept might not be the best for the production of
data, but we are designing a storage DTD, and there is nothing that stops
producers from creating the information in accordance with a modified DTD
that allows HyTime links inside information modules, if only they deliver
in according to this DTD.

[Regarding attribute "datatype" on the endterm element:]

|   Nice use of endterm.  Note that the datatype attribute should be
|   unneeded because the object located will indicate its type as part of
|   the data entity declaration.  By the same token, elements carry their
|   own type information, which the processing application doesn't need
|   until it actually gets the data (it doesn't need formatting information
|   before it has the stuff it's going to format).

I didn't explain the use of "datatype" clear enough, I'm sorry.  In the
information modules, we don't have any elements named \<tool>, \<sparepart>,
\<consumable> or alike, where the user have to enclose the name of the
tool/sparepart/consumable with the appropriate element.  Instead, we want
all tool names, sparepart names and id's, etc., to be included into the
module from a common storage, e.g., a "tool-module" or an LSAR data base.
This is done by using the anchor element "include-data", which is only used
for this purpose.

There are, especially among consumables, frequent changes of the names
and/or id's.  Today, references to consumables are often made to the safety
precautions document, rather than to the FMV id for it, just because this
FMV id changes with every brand of kerosine, or whatever.

What we want to achieve by this is that you link to a generic tool (or
specific, depending on your needs at that specific point in your module),
and whenever the id or name changes, the change is made in one place.  Now,
if one anchor point is \<include-data>, what is the other?  It could be a
bibliographic reference, since the object is a real object that is not
stored in the computer.  This would be a logical solution, but not very
efficient (or?).  We envisage the other anchor as being one element in a
"material-module" or "techdata-module".  That element have a sub-element
structure, which includes a lot of information regarding the tool,
consumable, etc.  My problem here is that I want to select only some of
that sub-element structure, and I don't know what parts of it I'm
interested in when writing the module.

Let me explain:
In the infomodule I have this:
	Clean the parts carefully, using \<include-data id="xyz">.

In the "techdata-module" we have the following:
      \<consumable id=g12>              \<!-- One type of consumable -->
        \<name>Kerosene                 \<!-- FMV generic name -->
        \<safetyprec id=g13>            \<!-- Doc ref (new link anchor) -->
        \<alt>                          \<!-- One alternative brand -->
          \<prodname>X12-Kerosene       \<!-- Product name -->
          \<fmv-id>F1234-567890         \<!-- FMV ID -->
          \<orig-id>KEM887-7654H-W      \<!-- Original ID -->
          \<alt-id>NOK56776-TY643       \<!-- Alternative ID -->
        \</alt>
        \<alt>                          \<!-- One alternative brand -->
           \<!-- Another brand of kerosene -->
        \</alt>
      \</consumable>

Now, in the printed document I don't want all different ID's to show up,
just the name and the reference to the safety precautions document (must be
retrieved from the safetyprec link, but that's another story).  In a screen
application, say, I want only the name as a "button".  When I place the
cursor above the button, I want a pop-up that shows the FMV-ID for all
alternatives, and when I click the button I want to traverse to the safety
precautions document.

OK, so what about attribute "datatype", then?

Basically it is up to the application to decide what information should be
presented when an inclusion of data is made, as above, but we feel that we
ought to give the author the possibility to *suggest* what information
should be presented.  If a second reference to kerosine is made three lines
further down, it might not be necessary to present as much information as
the first time, as an example.

The attribute "datatype" is declared to give that possibility of suggesting
how presentation of the data is going to be made.  It's a CDATA attribute,
but with (so far only implied) lextype=GI+.  The GI's should also be one
that exists inside the referenced element (e.g., "name" in the example
above).  (By the way, the name datatype is awful and doesn't say what it
does.  It will be changed.)

I hope this long explanation made some sense...

I'm wondering if this is the same thing as the HyTime attribute "reftype"
attempts to do?  I don't think I can simply change to reftype here, but
maybe reftype will do the job better for me?


|   \<!NOTATION InfoModule PUBLIC "-//Owner//DTD InfoModule Document type//EN" >
|   \<!ATTLIST #NOTATION InfoModule
|             HyTime NAME #FIXED "document"

You give the above example, but in analogy to similar declarations in
IBMIDDoc it might look like:

     \<!NOTATION InfoModule  PUBLIC
                "+//ISO/IEC 8879:1986//NOTATION SGML Document//EN" >
     \<!ATTLIST  #NOTATION InfoModule
                HyTime           NAME         #FIXED    "document" >

In the first example the public id is declared as a DTD, in the second as a
notation.  Is it even possible to define a Notation as a DTD?  Why not as
an SGML document, subdocument, or text entity, then?  What are the
implications of the two different ways to declare the public entity?  Any
recommendations?  My guess is that NOTATION must be used, at least when you
don't know which DTD the module (document) uses.

|   Note also that issues of document fragment re-use can be solved by
|   using subdocuments as the re-usable components of larger documents (I
|   explain this use of subdocuments in my forthcoming book, and you can
|   also find an earlier post of mine on the subject in the SGML archive).

I found an earlier posting of yours, a reply to Robin Cover entitled "Re:
SUBDOC and (SGML) ESIS".

What you say there is very interesting, and I hadn't thought of the
possible name conflicts with just including elements into modules. But
further questions arise:

-   Is it possible to link to an element inside an SGML entity as a SUBDOC
    (e.g. an element inside a module, not the entity as a whole)? We want
    to do this to create variants, for example.

-   How does HyTime unified name space help if the subdoc becomes part of a
    larger document? And how can a conref attribute have the same value
    whether the content is located in the same SGML entity (and name space)
    or if it is an external entity? In the first case is simply an IDREF,
    in the second it's an ENTITY. I can't even see that we can use the same
    attribute declaration.

|   Hope you find my comments useful.  You've helped my point exposing a
|   hole I had in my book (no discussion of multiple locations).

I certainly do!  I was hoping to get a response like this, but I didn't
dare believe in it.  I'm happy that my posting helped at your side, too.
That, I hadn't expected!

Peter
--
Peter Bergstr\&ouml;m           internet:  pebe@telub.se
Telub Inforum AB               phone:     +46 - 470 - 425 97
351 80  V\&Auml;XJ\&Ouml;        fax:       +46 - 470 - 397 96
SWEDEN
</message>
<message id="<m0pyhb5-0000zZC@uahcs2.cs.uah.edu>" date="2977049160">
Newsgroups: comp.text.sgml
Date: 04 May 1994 14:06:00 UT
From: Lori Snyder \<lsnyder@uahcs2.cs.uah.edu>
Message-ID: \<m0pyhb5-0000zZC@uahcs2.cs.uah.edu>
Subject: CONREF

I have a question about the default value CONREF.  I know that if the
attribute is specified, the element associated with it becomes an empty
element.  If a value is not specified, then the element will contain
whatever its content model is.

What I haven't figured out is the minimiztion of the end tag for the
element.  If I do not have minimization on the end tag, the 'SGMLS' parser
gives me a warning.  I've been told that Author/Editor and XGML Validator
do not issue a warning.

I didn't see anything about the minimization for this particular case in
the SGML books I have.  Can anyone enlighten me on this issue?

\</lori>

--
Lori Snyder		lsnyder@cs.uah.edu
Huntsville, AL
</message>
<message id="<199405041420.AA24030@Norway.EU.net>" date="2977049929">
Newsgroups: comp.text.sgml
Date: 04 May 1994 14:18:49 UT
From: Chet Ensign \<DOCCOE@ibivm.ibmmail.com>
Message-ID: <199405041420.AA24030@Norway.EU.net>
Subject: Next meeting of the SGML Forum of NY

The SGML Forum of New York invites you to attend

                     "Max-Factors for Successful SGML"

Date:   Monday, May 9, 1994, 5:30 p.m.
Place:  McGraw-Hill, Inc.
        50th Floor Executive dining rooms (elevators are across
             from the entrance)
        1221 Avenue of the Americas (lobby entrance off 49th Street)
Agenda: 5:30 Registration and Opening Remarks
        5:45 Keynote Speaker:  Robert J. Glushko, Passage Systems, Inc.
             "Making a successful migration to SGML"
        6:30 Refreshments
        7:00 Vendor Demonstration:  Robert J. Glushko,  PassagePRO

"A focus on tools alone is insufficient to ensure a successful project,"
states Bob Glushko; "What is needed for success with SGML is a framework in
which technology and process fit together."  At our May meeting, Bob will
present the framework and the key factors that every successful SGML
project must have.  He will describe the principles he has adapted in the
course of helping a number of companies make a successful migration to
electronic document delivery and SGML.  Bob says that he likes to make his
talks "an antidote for the usual technology and syntax-intensive
discussions."

For our vendor demonstration, Bob will show us PassagePRO, an SGML-based
document management and production system that supports the automated
conversion of information into SGML and the subsequent assembly of SGML
into hardcopy and online documents.


Upcoming Programs:

Our final meeting before the summer break will be held on June 14th in
conjunction with the Graphic Communication Association's Online '94
conference.  Evan Owens of the University of Chicago Press will tell us how
they implemented SGML.  David Silverman will talk about the services
offered by Data Conversion Laboratory, a leading SGML conversion service
bureau.  And Bob Hecht and Sybil Parker of McGraw-Hill will describe the
development of the Encyclopedia of Science and Technology, a multimedia
encyclopedia produced using SGML.


About the Forum

The SGML Forum of New York is a nonprofit organization devoted to the
exchange of ideas and information about SGML.  Organized primarily as a
user group, the Forum seeks to promote an understanding of the scope and
benefits of the SGML standard and to further its practical application
within a variety of industries, including publishing, financial services,
insurance, pharmaceuticals, and telecommunications.

Membership helps support the Forum's activities, but it is not required for
attending general meetings.

If you would like to receive word about future Forum meetings, send your
name and address to:

  SGML Forum of New York, Inc.
  Bowling Green Station
  P.O. Box 803
  New York, NY  10274-0803

We will be happy to put you on our mailing list.

Best regards,

/chet

--
Chet Ensign                             +1 212 736 6250 X4349
Information Builders, Inc.              internet: doccoe@ibivm.ibmmail.com
1250 Broadway                           ibmmail:  USUBUVMV@IBMMAIL
New York, NY  10001                     compuserve: 73163,1414
</message>
<message id="<2q8m1v$rdh@ulowell.uml.edu>" date="2977061375">
Newsgroups: comp.text.sgml
Date: 04 May 1994 17:29:35 UT
From: Lloyd Rutledge \<lrutledg@cs.uml.edu>
Organization: UMass-Lowell Computer Science
Message-ID: <2q8m1v$rdh@ulowell.uml.edu>
References: <2DC7EB42@noak.vxo.telub.se>
Keywords: HyTime, hyperlinking
Subject: Re: Again: HyTime links in practice

[Peter Bergstrom]

|   Which form should I use, nHyTime or sHyTime?  Is it an error to use one or
|   the other?  What is used by for example HyMinder?

The architectural form "nHyTime" was specified in the HyTime Draft
International Standard and was replaced by "sHyTime" in the final version
of the standard.  Thus, of course, you should use "sHyTime".

|   [ discusses "datatype" attribute which specifies GI of child of
|   identified element to present ]
|   I'm wondering if this is the same thing as the HyTime attribute
|   "reftype" attempts to do?  I don't think I can simply change to reftype
|   here, but maybe reftype will do the job better for me?

I don't think reftype is applicable here.  The attribute reftype specifies
what element types an UID reference is allowed to reference an instance of.
It is a HyTime common attribute, and they typically serve as what could be
loosely called "DTD extensions" -- that is, they specify what is allowed in
a document in ways the go beyond the restrictions a DTD can place.

Instead of stating what references are allowed, you want to specify as the
final reference a particular child of the element directly referenced.  I
would suggest looking at the architectural forms in the location address
module to do this trick.  Perhaps you could use a treeloc or a pathloc to
specify the child by the ordination of its appearence in the parent's
content. Or even better, use a HyQ query to access the child of the
referenced element that has that GI.

--
Lloyd Rutledge
Interactive Media Group
Computer Science Department
University of Massachusetts
One University Avenue
Lowell, MA 01854 USA
voice: 508-934-3554
fax:   508-452-4298
email: lrutledg@cs.uml.edu
</message>
<message id="<DMEGGINS.94May4133657@aix1.uottawa.ca>" date="2977061814">
Newsgroups: comp.text.sgml
Date: 04 May 1994 17:36:54 UT
From: David Megginson \<dmeggins@aix1.uottawa.ca>
Organization: Department of English, University of Ottawa
Message-ID: \<DMEGGINS.94May4133657@aix1.uottawa.ca>
Keywords: entities pubid
Summary: Are device-specific entities better than descriptive entities?
Subject: Using Public Identifier Versions

I have been doing some thinking about the version field in public
identifiers, and I wonder whether it isn't best to avoid it altogether.
For example, there are two models for handling the character entities when
converting an SGML document to LaTeX: in the first, simply use the standard
public identifiers like

  ISO 8879:1986//ENTITIES Added Latin 1//EN

replacing the entities with SDATA sequences like "[aacute]", then in the
postprocessor, convert those SDATA sequences to LaTeX sequences like
"{\\'a}".  The second approach is to make a LaTeX-specific version of the
entities, declared with the public identifier

  ISO 8879:1986//ENTITIES Added Latin 1//EN//LaTeX

where the entity file contains declarations like

  \<!ENTITY aacute CDATA "{\\'a}" -- a acute -->

instead of

  \<!ENTITY aacute SDATA "[aacute]" -- a acute -->

(With SGMLS, I could set SGML_PATH to something including
"/usr/lib/sgml/public/%O/%C/%V/%T", and keep the LaTeX-specific entities in
"/usr/lib/sgml/public/ISO_8879:1986/entities/LaTeX/Added_Latin_1" -- I
understand that the new version of SGMLS uses catalogues to simplify this
somewhat.)

I have read Goldfarb's brief comments on this point in the HANDBOOK, but he
does not seem to express a qualitative judgement on the two approaches.
The second seems good, since it allows the SGML parser itself to do most of
the work, using the entity manager for its intended purpose.

Personally, however, I think that the first is much better style -- granted
that the postprocessor ends up doing a lot of work which the entity manager
could have handled, but the document is no longer linked to a single
conversion format.  If I use the public identifier "ISO 8879:1986//ENTITIES
Added Latin 1//EN//LaTeX" in my document, I am declaring that this document
will be used for conversion to LaTeX and nothing else; on the other hand,
if I leave off the "//LaTeX", I have not committed myself, and can use the
document for anything from LaTeX to database or hypertext applications.

I would be interested to hear other people's views on this subject.

David

--
David Megginson                Department of English, University of Ottawa,
dmeggins@aix1.uottawa.ca       Ottawa, Ontario, CANADA  K1N 6N5
dmeggins@acadvm1.uottawa.ca    Phone: +1 613 564 6850 (Office)
ak117@freenet.carleton.ca             +1 613 564 9175 (FAX)
</message>
<message id="<19940504.2068@naggum.no>" date="2977081754">
Newsgroups: comp.text.sgml
Date: 04 May 1994 23:09:14 UT
From: Erik Naggum \<erik@naggum.no>
Organization: Naggum Software; +47 2295 0313
Message-ID: <19940504.2068@naggum.no>
References: \<m0pyhb5-0000zZC@uahcs2.cs.uah.edu>
Subject: Re: CONREF

[Lori Snyder]

|   What I haven't figured out is the minimiztion of the end tag for the
|   element.  If I do not have minimization on the end tag, the 'SGMLS'
|   parser gives me a warning.  I've been told that Author/Editor and XGML
|   Validator do not issue a warning.

|   I didn't see anything about the minimization for this particular case
|   in the SGML books I have.  Can anyone enlighten me on this issue?

Goldfarb, [408:20] reads

    11.2.2 Omitted Tag Minimization

    "O" should be specified for end-tag minimization if the element has a
    content reference attribute or a declared value of "EMPTY".

    Note -- Specifying "O" serves as a reminder that empty elements do not
    have end-tags (although this has nothing to do with markup
    minimization).

I don't necessarily agree with this suggestion.  if the end-tag is required
to be absent, it is not "omissible", and it is in fact confusing to have it
specified as such.  marking it as omissible also means you can't enforce
the end-tag when this is something you might specifically want to do.

</>
--
Erik Naggum \<erik@naggum.no> \<SGML@ifi.uio.no>  |  memento, terrigena.
ISO 8879 SGML, ISO 10744 HyTime, ISO 10646 UCS  |  memento, vita brevis.
</message>
<message id="<19940504.2069@naggum.no>" date="2977082348">
Newsgroups: comp.text.sgml
Date: 04 May 1994 23:19:08 UT
From: Erik Naggum \<erik@naggum.no>
Organization: Naggum Software; +47 2295 0313
Message-ID: <19940504.2069@naggum.no>
References: \<DMEGGINS.94May4133657@aix1.uottawa.ca>
Subject: Re: Using Public Identifier Versions

[David Megginson]

|   I have been doing some thinking about the version field in public
|   identifiers, and I wonder whether it isn't best to avoid it altogether.

I think it should be implemented the right way, instead.  I'd be surprised
if you _did_ like the results you have got, considering that you do it the
wrong way.

|   Personally, however, I think that the first is much better style --
|   granted that the postprocessor ends up doing a lot of work which the
|   entity manager could have handled, but the document is no longer linked
|   to a single conversion format.  If I use the public identifier "ISO
|   8879:1986//ENTITIES Added Latin 1//EN//LaTeX" in my document, I am
|   declaring that this document will be used for conversion to LaTeX and
|   nothing else; on the other hand, if I leave off the "//LaTeX", I have
|   not committed myself, and can use the document for anything from LaTeX
|   to database or hypertext applications.

good argument, and in fact arguing strongly in favor of what the standard
suggests (ISO 10.2.2.5; Goldfarb [390:9]):

    When a system accesses public text for which a public text for which
    a _public text display version_ could have been specified but was not,
    it must substitute the best available system-dependent version for the
    display device in use.  If there is none, no substitution occurs.

that is, leave it out of the document, and leave it to the SGML parser to
inform the entity manager to look for a public text display version.  that
is, you get _both_ of the good things you describe, and _none_ of the bad.

so, DO NOT USE public text display version in the document.  but DO USE IT
in the catalog, and obtain an entity manager that can deal with this very
useful aspect of character entity sets in SGML.  those that cannot are very
clearly deficient.  there is not even a choice in the matter.  the wording
above is "it must substitute the best available system-dependent version".

only if the SGML system itself is lacking such device-dependent entity sets
can it be excused of this requirement, but that is doing its users a major
disservice and forcing applications to do what the parser and the entity
manager _should_ be able to do, just as you point out.

however, all is not lost if you have a deficient entity manager, but you
_do_ have system-dependent versions of entity sets.  you can do the path
hack or the catalog hack and somehow specify which path or catalog to use
at runtime, but how about using LINK (provided you don't have a deficient
parser, as well :-)?

Link processing declarations can override entity declarations in the
document type to which it refers, and you can override entity declarations
to use the public text display version, under specific user control for the
individual, _named_ link processing declaration.  the name will have to be
specified at run-time, but that's no different from specifying the display
device to use, or the path or catalog to use.

since the display device is known to the application, and can more easily
be communicated to the parser than a link type with an unknown name (to the
parser, that is), using the public text display version is very clearly the
best of the available options.

by the way, I think using the public text display version for other things
than just "display versions" would be a tremendously good idea.  just call
it "version", and it may apply to document type declaration subsets that
are public text, and exist in various versions, but none without a version.
think of a NOTATION declaration which can be either a specification of the
notation, or a program or program fragment that validates or interprets the
notation.  (NOTATION is currently excluded from the kinds of public text
identifiers that can have it, but this can be fixed.)  a link processing
declaration subset may itself be device-dependent, and likely is, so it,
too, may be subject to this versioning scheme, not just character entity
sets.

the possibilities of an _implemented_ versioning scheme like this are
numerous, and the problems solved by it are so many and so obvious that one
is led to wonder why vendors have not implemented it.  my guess it that if
you have do "conversion" from [aacute] to whatever you really need, you
don't get around using their proprietary scripting language.  that's
probably why they haven't implemented LINK, either.

only when enough people know about these possibilities can a need for them
be felt in the market, and vendors be suggested to implemented what they
otherwise wouldn't want to.

</>
--
Erik Naggum \<erik@naggum.no> \<SGML@ifi.uio.no>  |  memento, terrigena.
ISO 8879 SGML, ISO 10744 HyTime, ISO 10646 UCS  |  memento, vita brevis.
</message>
<message id="<2q9o8f$qqs@news.delphi.com>" date="2977096399">
Newsgroups: comp.text.sgml
Date: 05 May 1994 03:13:19 UT
From: Kevin C Cole \<kccole@news.delphi.com>
Message-ID: <2q9o8f$qqs@news.delphi.com>
Keywords: sgml DOS
Subject: DOS-based SGML formatters?

Does anyone know of DOS-based software that will format SGML documents?

I haven't seen any since I wrote some doc for IBM about 10 years ago and
used their PC-GML formatter w/Sherpa.

I'd really appreciate any information.  Thanks.

--Kevin

kccole@delphi.com
</message>
<message id="<mh.768117224@batman>" date="2977106024">
Newsgroups: comp.text.sgml
Date: 05 May 1994 05:53:44 UT
From: Martin Hollender \<mh@batman.imat.maschinenbau.uni-kassel.de>
Organization: Free University of Berlin, Germany
Message-ID: \<mh.768117224@batman>
References: <767060935.AA00364@bcnz.ac.nz> \<michael_will-030594121302@m71007.esl.com>
Subject: Re: Single Text, Many Uses (was Re: Real, true uses of SGML)

[Michael Will]

|   Could you provide reference(s) to "Information Mapping" analysis
|   technique?  Thanks in advance.

Horn, R.E. (1989) Mapping Hypertext. The Lexington Institute, Lexington, MA
Contains a lot of further references to "Information Mapping"

Ciao Martin
--
Martin Hollender      Man-Machine Systems Lab    University of Kassel
D-34109 Kassel        Germany                    tel.  ++49-561-804-2701
email: mh@imat.maschinenbau.uni-kassel.de        fax.  ++49-561-804-3542
</message>
<message id="<2DC91527@noak.vxo.telub.se>" date="2977107120">
Newsgroups: comp.text.sgml
Date: 05 May 1994 06:12:00 UT
From: Peter Bergstrom \<pebe@telub.se>
Message-ID: <2DC91527@noak.vxo.telub.se>
Subject: HyTime prolog

Hi!

Here's a few questions concerning the prolog of a HyTime document.

As far as I can see from the HyTime standard, the APPINFO parameter should
be:

    APPINFO "HyTime"

But if I choose to rename (in my DTD) either the HyTime attribute itself,
or the HyName attribute, then the mapping of these should be included, say:

    APPINFO "HyTime  HyTime=My-HyTime-Name,My-HyName-Name"

Is this correct? Can somebody verify or correct what I stated above?


Regarding  the  HyTime Support   Declarations  (clause  6.3.2) I  have  the
following thoughts at this time:

Let's say I have the following Support Decl's:

    \<?HyTime VERSION "ISO/IEC 10744:1992" HYQCNT=32 >
    \<?HyTime MODULE base>
    \<?HyTime MODULE locs mixspace multloc spanloc >
    \<?HyTime MODULE links >

According to the standard they shall appear "in the document prolog, prior
to the base document type declaration". I can then see three possibilities:

1.  They can be placed as the first thing in the Declaration Subset of the
    DOCTYPE statement (which I believe is actually part of the base
    document type declaration):

        \<!DOCTYPE MyDoc SYSTEM "MyDoc.DTD" [
        \<?HyTime VERSION "ISO/IEC 10744:1992" HYQCNT=32 >
        \<?HyTime MODULE base>
        \<?HyTime MODULE locs mixspace multloc spanloc >
        \<?HyTime MODULE links >
        \<!-- Here can other declarations, like external entities come. -->
        ]>

2.  They can be placed in the SGML Declarations File, after the SGML
    Declaration, of which APPINFO is the last parameter. This is however a
    "grey zone" and I have no idea if any parser/HyTime engine will ever
    see the declarations here.

        \<!SGML  "ISO 8879:1986"
        -- SGML Declarations omitted here --
        APPINFO "HyTime"
        -- This is the end of SGML Declaration --
        >
        \<?HyTime VERSION "ISO/IEC 10744:1992" HYQCNT=32 >
        \<?HyTime MODULE base>
        \<?HyTime MODULE locs mixspace multloc spanloc >
        \<?HyTime MODULE links >

3.  They can be the very first thing of the DTD, but it seems to me that
    then the "base document type declaration" has already started (which
    might be the case even for alternative 1 above).

        (I don't think it's necessary to give an example of this.)

I find all these unsatisfactory, and I see no example of this in the
standard. Are there other alternatives? How do other people do? What do
HyTime engines expect (exisiting and in development)?

Thanks!
Peter

--
Peter Bergstrom            internet:  pebe@telub.se
Telub Inforum AB           phone:     +46 470 425 97
351 80  VAXJO              fax:       +46 470 397 96
SWEDEN
</message>
<message id="<2qc1je$g4o@search01.news.aol.com>" date="2977171502">
Newsgroups: comp.text.sgml
Date: 06 May 1994 00:05:02 UT
From: Eliot Kimber \<drmacro@aol.com>
Message-ID: <2qc1je$g4o@search01.news.aol.com>
References: <2DC91527@noak.vxo.telub.se>
Subject: Re: HyTime prolog

[Peter Bergstrom]

|   1.  They can be placed as the first thing in the Declaration Subset of the
|       DOCTYPE statement (which I believe is actually part of the base
|       document type declaration):
|
|       \<!DOCTYPE MyDoc SYSTEM "MyDoc.DTD" [
|       \<?HyTime VERSION "ISO/IEC 10744:1992" HYQCNT=32 >
|       \<?HyTime MODULE base>
|       \<?HyTime MODULE locs mixspace multloc spanloc >
|       \<?HyTime MODULE links >
|       \<!-- Here can other declarations, like external entities come. -->
|       ]>

Your answer 1 is the only right answer.  The intent for putting the
declarations in the SGML document proper, and not in the public part of the
DTD or in the SGML declaration itself, is that each instance may have
different requirements, and specifically, because HyTime validation is done
on an instance basis.

Of course, there's no reason these PIs couldn't be put into an external
parameter entity and referred to from within the DTD subset at the proper
point -- it would be trivial to add the declaration and reference to a
document template or something.

I don't know if any current tools enforce this requirement.

--
\<Address HyTime=bibloc>
W. Eliot Kimber (kimber@passage.com) Systems Analyst and HyTime Consultant
Passage Systems, Inc., 9971 Quail Blvd., Suite 903, Austin TX 78758
(512)339-1400
465 Fairchild Dr., Suite 201, Mountain View, CA  94043, (415) 390-0911
\</Address>
</message>
<message id="<1994May6.082807.19224@leeds.ac.uk>" date="2977201687">
Newsgroups: comp.text.sgml
Date: 06 May 1994 08:28:07 UT
From: N F Drakos \<nikos@cbl.leeds.ac.uk>
Message-ID: <1994May6.082807.19224@leeds.ac.uk>
References: \<michael_will-030594121302@m71007.esl.com>
Subject: Re: Single Text, Many Uses (was Re: Real, true uses of SGML)

[Errol Thompson]

|   I beleive that the traditional prose for printed material is not a good
|   approach to implementing online information.  However there are
|   techniques for analysing information that lead to different styles of
|   presentation in both the printed and online media.  One such method is
|   Information Mapping.

Some of the issues in "single text, many uses" are discussed in a paper to
be presented at WWW'94 entitled "From Text to Hypertext: A Post-Hoc
Rationalisation of LaTeX2HTML".  In particular it proposes ways to address
common objections to automatic conversion (in the context of HTML) such as:

       o Word-processing formats contain presentation information that
         cannot be reproduced in HTML

       o Conventional paper-based documents have a linear structure and are
         therefore unsuitable for conversion to the hypertext form

       o It is difficult to determine the level of granularity for each
         hypertext node

       o Some documents require custom navigation aids

       o The conversion process is inflexible and converted documents
         cannot take full advantage of hypermedia capabilities

       o There are subtle differences between electronic and paper-based
         versions and this will compromise quality

This paper is available online as:

    http://cbl.leeds.ac.uk/~nikos/doc/www94/www94.html

This node contains a link to a postscript version.

Also, the node

    http://cbl.leeds.ac.uk/~nikos/doc/www94/section3_5.html#refgat

contains an interactive control panel which can be used to try out the
conversion mechanism on different small examples and see the results.  (You
will need a forms-capable browser to use this - eg a recent version of
Mosaic for X).

Nikos.

--
Nikos Drakos			
Computer Based Learning Unit   	nikos@cbl.leeds.ac.uk
University of Leeds		http://cbl.leeds.ac.uk/nikos/personal.html
</message>
<message id="<CpDsMn.2xt@cwi.nl>" date="2977216943">
Newsgroups: comp.text.sgml
Date: 06 May 1994 12:42:23 UT
From: Frank Teusink \<frankt@cwi.nl>
Organization: CWI, Amsterdam
Message-ID: \<CpDsMn.2xt@cwi.nl>
Subject: Q: Is SGML what I'm looking for?

Hi,

I am looking for a way to transform a relational database of `famous
persons' (result of a hobby of me and a few others) to something that is
more flexible.

Right now, the database contains for each person a number of data-fields
(i.e., name, data-of-birth, place-of-birt, occupation, etc), and a
text-field, plus a number of relations on persons (i.e., parent, marriage,
etc).

It is quite hard to generate readable `articles' from this database;

   - one has to generate a readable paragraph of text from the data-fields,
     and
   - combine it with the text-field to a readable `article'.

Via the HTML language, I arrived at SGML, which might be a solution to my
problem.  Ideally, I'd like to transform the database to something like

    \<person>
    \<h1>\<name>Henry IV\</name> (Bolingbroke),
        \<title>king of England\</title>\</h1>

    Son of \<father>...\</father> and \<mother>...\</mother>.  Born on
    \<date-of-birth>...\</date-of-birth> at ...

    blabla ... blabla ...
    \</person>

Here, `h1' would be a mark-up tag to define the layout of the article
(there could be others too), while the other tags used in the example would
be `attribute-tags'.

However, I would still need some database-functionallity.  Most
importantly, I need to be able to generate indexes on things like \<name>
and \<date-of-birth>, and be able to answer querie to the database.  I have
some other things on my wish-list, but I won't bother you with them righ
now ;-)

Is this possible within the framework of SGML? 
Are there `SGML-databases' that can be used for something like this?
Where are they?

As an aside, I would be grateful for references to a good book on SGML.

Regards,

Frank Teusink
frankt@cwi.nl
</message>
<message id="<TSUCHIYA.94May6132901@maya.sysrap.cs.Fujitsu.Co.JP>" date="2977219741">
Newsgroups: comp.text.sgml
Date: 06 May 1994 13:29:01 UT
From: Satoshi Tsuchiya \<tsuchiya@sysrap.cs.fujitsu.co.jp>
Organization: Sect. 5, Planning Dep., Program Products Group, Fujitsu Ltd.
Message-ID: \<TSUCHIYA.94May6132901@maya.sysrap.cs.Fujitsu.Co.JP>
References: <2q2eoo$7tb@news.csie.nctu.edu.tw>
Subject: Re: SGML->LaTeX->EPS

[Yue-ming Lan]

|   My idea is converting SGML to LaTex and then to EPS. I was told there
|   is an utility does this, where to find it? Can it process double byte
|   character?  (I mean Chinese or Japanese)?  Is there a Japanese version
|   of SGML2Latex converter?

I have modified "sgmls" (free sgml parser written by James Clark) to handle
Microsoft Kanji code (known as Shift-JIS code).  Unfortunately it is not
double byte oriented, but you can convert SGML document which is encoded by
Shift-JIS code to other tagged document according to a conversion
specification.

Speaking roughly, "sgmls" treats a double byte encoded character as two
single byte encoded character internally.  Because applications don't have
to know about internal process of a parser, there is no practical problem.
If you would use Extended UNIX code (EUC) system, "sgmls" can treat
multibyte encoded character.  In case of Microsoft Kanji code system, it
uses ASCII graphic character area as a second byte of double byte code.  So
single byte oriented parser misunderstands some of second byte of double
byte code as a significant SGML character.  For this case, I have modified
"sgmls" source to convert Microsoft Kanji code system to EUC-base code
system at I/O calling point.

If you are interested in my modification, please mail to me.

--
TSUCHIYA Satoshi     | tsuchiya@sysrap.cs.fujitsu.co.jp | NIFTY-ID:GDC02435
$@EZ(J $@20(J  $@E/(J ($@O5<@$N?M(J) | $@IY;N(J$@DL(J Middleware $@;v6HK\\It(J $@4k2hIt(J $@7W2hIt(J
</message>
<message id="<2qe3b6$f7@search01.news.aol.com>" date="2977238822">
Newsgroups: comp.text.sgml
Date: 06 May 1994 18:47:02 UT
From: Jeremy LDI \<jeremyldi@aol.com>
Organization: America Online, Inc. (1-800-827-6364)
Message-ID: <2qe3b6$f7@search01.news.aol.com>
Subject: SGML to LATEX

I would be interested in any work done in converting mathematical text from
SGML to LATEX.  Possible collaboration on developing a filter?
</message>
<message id="<9405062139.AA01735@laser13.cs.uah.edu>" date="2977249195">
Newsgroups: comp.text.sgml
Date: 06 May 1994 21:39:55 UT
From: Lori Snyder \<lsnyder@uahcs2.cs.uah.edu>
Organization: Computer Science Dept., Univ. of Alabama-Huntsville
Message-ID: <9405062139.AA01735@laser13.cs.uah.edu>
Subject: SGML Instance tagged to 87269 spec

Lately there has been a lot of talk about software that is MIL-D-87269
compliant.  There are vendors claiming that their product is FULLY
compliant.  They will demo their product, but they never show what the SGML
instance looks like.  Why is that?

Does anyone know where I could get a SGML instance tagged to the 87269
spec?  The spec itself is very complex and very hard to understand without
an example of how some of the tags are to be applied.  Take for example the
"alert" tag.  There are several attributes, but nothing says what the
attribute values are supposed to mean.  Does the alert tag have to appear
in every frame that it applies to?  How else would you know when the alert
no longer applies?

\</lori>

--
Lori Snyder		lsnyder@cs.auh.edu
</message>
<message id="<CpEu84.B4s@exodus.iti.gov.sg>" date="2977265668">
Newsgroups: comp.text.sgml
Date: 07 May 1994 02:14:28 UT
From: Kay Kit Chan \<kaykit@iti.gov.sg>
Organization: Information Technology Institute, National Computer Board, S'pore
Message-ID: \<CpEu84.B4s@exodus.iti.gov.sg>
Subject: Fax/Email address of Interleaf

Hi,

From a product brochure, I managed to obtain the address and telephone
number of Interleaf as:

           Prospect Place
	   9 Hillside Ave
           Waltham, Massachusetts 02154
           Tel: +1 617 290 0710

but can't seem to find their fax or email address. Can someone help?
Please direct replies to the email address below.

Thanks in advance!

---
Kay-Kit CHAN    			| Internet: kaykit@iti.gov.sg
Information Technology Institute	| Tel:      +1 65 772 0920
National Computer Board of Singapore    | Fax:      +1 65 777 3043
</message>
<message id="<RATH.94May7103935@kippis.igd.fhg.de>" date="2977288774">
Newsgroups: comp.text.sgml
Date: 07 May 1994 08:39:34 UT
From: Hans Holger Rath \<rath@igd.fhg.de>
Organization: Computer Graphics Center, Darmstadt, Germany
Message-ID: \<RATH.94May7103935@kippis.igd.fhg.de>
Subject: [Q] DTD of the chemical industry?


Hi everybody!

A friend of mine asked me to post this article into comp.text.sgml because
he has no news access.

So, please send any answers to him via email. His address is:

	Markus Diermann \<die05@bertelsmann.de>

------------------------

Is there a group that has developed a DTD for the chemical industry?

Any hints are useful!


Thanks,

--Markus

------------------------
</message>
<message id="<MLEISHER.94May7152258@pylos.crl.nmsu.edu>" date="2977312978">
Newsgroups: comp.text.sgml
Date: 07 May 1994 15:22:58 UT
From: Mark Leisher \<mleisher@crl.nmsu.edu>
Organization: Computing Research Lab
Message-ID: \<MLEISHER.94May7152258@pylos.crl.nmsu.edu>
Subject: FOSI info

This is probably a FAQ somewhere, but I couldn't find it: where can I get
info on FOSI other than the files at ftp://wpaftb1.wpafb.af.mil/sgml/?

Being a novice, these files are a little daunting.

--
mleisher@crl.nmsu.edu
Mark Leisher
Computing Research Lab                    "Elegance is not optional!"
New Mexico State University                  -- Attributed to Richard O'Keefe
Box 3CRL, Las Cruces, NM  88003
</message>
<message id="<bkrewin.9.2DCBD64F@cscns.com>" date="2977322192">
Newsgroups: comp.text.sgml
Date: 07 May 1994 17:56:32 UT
From: Bill Krewin \<bkrewin@cscns.com>
Organization: Regulation Scanning
Message-ID: \<bkrewin.9.2DCBD64F@cscns.com>
Keywords: sgml consultant
Subject: consultant needed

Our company publishes information electronically.  We have approx 100,000
documents that are tagged in a proprietary format.

Other companies have expressed strong interest in purchasing this data and
accessing it via various search software.  The search software these other
companies use is typically a combination of full text and hypertext.  We
want to convert from this proprietary format to SGML.  Our experience with
SGML is that we can spell it.

We have a technically competent staff of four programmers and a data
operations department of twenty people.  What we need is an experienced
guide that will assist us in our journey.

Please reply via email if you are interested or can assist us in locating
such an individual.
</message>
<message id="<2DCE5E29@noak.vxo.telub.se>" date="2977396320">
Newsgroups: comp.text.sgml
Date: 08 May 1994 14:32:00 UT
From: Peter Bergstrom \<pebe@telub.se>
Message-ID: <2DCE5E29@noak.vxo.telub.se>
References: <2DC7EB42@noak.vxo.telub.se> <2q8m1v$rdh@ulowell.uml.edu>
Subject: Re: Again: HyTime links in practice

[Lloyd Rutledge]

|   The architectural form "nHyTime" was specified in the HyTime Draft
|   International Standard and was replaced by "sHyTime" in the final
|   version of the standard.  Thus, of course, you should use "sHyTime".

I have used a copy of the International Standard (not the draft) that I
received from the SGML SIG Hyper, through Steven Newcomb, TechnoTeacher.
The reason for me to use this one was that at the time I started to use the
standard, the Swedish Standard Commision had a time scale for the delivery
for about a year and a half (I have no idea of why, it does not usually
take that long).  On my copy it says "First Edition 1992-08-00", which is a
bit cryptic, but it is the true standard, or at least, so say SIG Hyper.
And I've got "nHyTime", not "sHyTime" in my copy.

Also, In the change notes for the Catalog of HyTime Architectural Forms and
HyTime SGML Specification, Version 2.0, June 28, 1993, by Charles F.
Goldfarb, it says:

||                      3. CORRECTIONS IN THIS VERSION
||
||  Although International Standards are subjected to extensive review and||  proofreading, early users frequently discover errors.  These are
||  reported to the Project Editor for eventual consideration by the
||  responsible standards committee, after which "corrigenda" may be
||  published to correct the errors officially.
||
||  Several errors and ambiguities have already been reported; the proposed
||  corrections and clarifications (hereinafter simply "corrections") for
||  some of them are reflected in this document.  While it must be stressed
||  that the corrections are not official (and theoretically might never
||  be), they are of such a nature that they are highly likely to be
||  accepted.  Corrections affecting only the text of the standard (that
||  is, not the formal SGML specifications included in parts 4 and 5 of
||  this document) are preceded with a currency sign ($).
||
||  Corrections added in this version of this Catalog are:
::
||  06.4.2      nHyTime: change to "sHyTime" ("suppress HyTime element");
||              only attributes of nested sHyTime elements are processed.

Therefore, I'm not sure you are right, and I can't see that it is obvious
that I have to use "sHyTime".  I have not yet found any clear reference to
what I'm supposed to use, and that's why I'm interested to know how the
only HyTime engine available handles this.

Peter

--
Peter Bergstr\&ouml;m           internet:  pebe@telub.se
Telub Inforum AB               phone:     +46 - 470 - 425 97
351 80  V\&Auml;XJ\&Ouml;        fax:       +46 - 470 - 397 96
SWEDEN
</message>
<message id="<2DCE5E30@noak.vxo.telub.se>" date="2977396560">
Newsgroups: comp.text.sgml
Date: 08 May 1994 14:36:00 UT
From: Peter Bergstrom \<pebe@telub.se>
Message-ID: <2DCE5E30@noak.vxo.telub.se>
Subject: JPEG and MPEG notations

Can somebody direct me to the correct syntax for the formal public
identifier for the JPEG standard and MPEG standard? I believe they are
currently both ISO-DIS standards, I've got these numbers:

    ISO DIS 10918 (JPEG)
    ISO DIS 11172 (MPEG)

From this I imagine they could be specfied something like...

    \<!NOTATION JPEG PUBLIC
        "ISO 10918//NOTATION Joint Photographers Expert Group//EN">

    \<!NOTATION MPEG PUBLIC
        "ISO 11172//NOTATION Motion Pictures Expert Group//EN">

... but are there any publicly released declaration of these somewhere,
which differ from my guess above?

Thanks!
Peter

--
Peter Bergstr\&ouml;m           internet:  pebe@telub.se
Telub Inforum AB               phone:     +46 - 470 - 425 97
351 80  V\&Auml;XJ\&Ouml;        fax:       +46 - 470 - 397 96
SWEDEN
</message>
<message id="<2DCE5E36@noak.vxo.telub.se>" date="2977397880">
Newsgroups: comp.text.sgml
Date: 08 May 1994 14:58:00 UT
From: Peter Bergstrom \<pebe@telub.se>
Message-ID: <2DCE5E36@noak.vxo.telub.se>
References: <2DC91527@noak.vxo.telub.se> <2qc1je$g4o@search01.news.aol.com>
Subject: Re: HyTime prolog

[Peter Bergstrom]

    1.  They can be placed as the first thing in the Declaration Subset of
|       the DOCTYPE statement (which I believe is actually part of the base
|       document type declaration):
:
|   2.  They can be placed in the SGML Declarations File, after the SGML
|       Declaration, of which APPINFO is the last parameter. This is
|       however a "grey zone" and I have no idea if any parser/HyTime
|       engine will ever see the declarations here.
:
|   3.  They can be the very first thing of the DTD, but it seems to me
|       that then the "base document type declaration" has already started
|       (which might be the case even for alternative 1 above).

[Eliot Kimber]

|   Your answer 1 is the only right answer.  The intent for putting the
|   declarations in the SGML document proper, and not in the public part of
|   the DTD or in the SGML declaration itself, is that each instance may
|   have different requirements, and specifically, because HyTime
|   validation is done on an instance basis.

Thanks Eliot, I will use this until I'm more satisfied with another answer.I still believe that what goes into the document type declaration subset is
part of the base document type declaration, that's at least what I can
figure out from the handbook, page 303 and 403.  What confuses me on page
303 is that "other prolog" is possible before the base document type
declaration, but I can't figure out where to put it, if not immediately
after the SGML declarations (i.e., at the end of the same file = my
original alternative 2).

We're designing a DTD that must not be changed very much by each single
user.  It is not a meta-DTD.  Small changes are allowed, which in practice
means that the user uses a subset of the DTD.  The HyTime support however,
is always required in the same way.  It is very important that the HyTime
links are used for references in the way specified by the DTD.  Thus, all
instances will have the same HyTime support declarations.

Is it then an error to place them as the first thing in the public DTD
(=alternative 3), which is in fact the base DTD?
|   I don't know if any current tools enforce this requirement.

Does anybody else know?

Peter
--
Peter Bergstr\&ouml;m           internet:  pebe@telub.se
Telub Inforum AB               phone:     +46 - 470 - 425 97
351 80  V\&Auml;XJ\&Ouml;        fax:       +46 - 470 - 397 96
SWEDEN
</message>
<message id="<2qjj62$85k@search01.news.aol.com>" date="2977418882">
Newsgroups: comp.text.sgml
Date: 08 May 1994 20:48:02 UT
From: Eliot Kimber \<drmacro@aol.com>
Message-ID: <2qjj62$85k@search01.news.aol.com>
References: <2DC91527@noak.vxo.telub.se> <2qc1je$g4o@search01.news.aol.com>
Subject: Re: HyTime prolog

[Peter Bergstrom]

|   1.  They can be placed as the first thing in the Declaration Subset of
|       the DOCTYPE statement (which I believe is actually part of the
|       base| document type declaration):
|   
|       \<!DOCTYPE MyDoc SYSTEM "MyDoc.DTD" [
|       \<?HyTime VERSION "ISO/IEC 10744:1992" HYQCNT=32 >
|       \<?HyTime MODULE base>
|       \<?HyTime MODULE locs mixspace multloc spanloc >
|       \<?HyTime MODULE links >
|       \<!-- Here can other declarations, like external entities come. -->
|       ]>

[Eliot Kimber]

|   Your answer 1 is the only right answer.  The intent for putting the
|   declarations in the SGML document proper, and not in the public part of
|   the DTD or in the SGML declaration itself, is that each instance may
|   have different requirements, and specifically, because HyTime
|   validation is done on an instance basis.

This is in fact wrong.  I had misremembered the meaning of "prolog",
confusing the proper meaning with "internal DTD subset".  The prolog of an
SGML document is that part between the SGML Declaration (if present) and
the document instance set.  The HyTime standard says the declarations must
occur in the document prolog before the base document type declaration (the
"base document type declaration" is the first one specified (SGML documents
may have multiple DTDs when the CURRENT feature is support)).  Thus, the
HyTime declarations can only go *before* the DOCTYPE declaration, e.g.,:

        \<!SGML ...>
        \<?HyTime VERSION "ISO/IEC 10744:1992" HYQCNT=32 >
        \<?HyTime MODULE base>
        \<?HyTime MODULE locs mixspace multloc spanloc >
        \<?HyTime MODULE links >
        \<!DOCTYPE Foodoc [...]>

Note that an SGML declaration cannot contain processing instructions.

I apologize for any confusion I may have caused.

--
\<Address HyTime=bibloc>
W. Eliot Kimber (kimber@passage.com) Systems Analyst and HyTime Consultant
Passage Systems, Inc., 9971 Quail Blvd., Suite 903, Austin TX 78758
+1 512 339 1400
465 Fairchild Dr., Suite 201, Mountain View, CA  94043, +1 415 390 0911
\</Address>
</message>
<message id="<CpIJv2.HpE@news.Hawaii.Edu>" date="2977438862">
Newsgroups: comp.text.sgml
Date: 09 May 1994 02:21:02 UT
From: Torben Noerup Nielsen \<torben@foralie.net.Hawaii.Edu>
Organization: University of Hawaii
Message-ID: \<CpIJv2.HpE@news.Hawaii.Edu>
Keywords: Equations, AAP
Subject: Documentation

For the last several weeks, I've been hunting around for documentation on a
couple of standards.  But without luck.

I'm looking for documentation for the AAP DTD.  I have heard that this is
now an ISO standard.  Does anyone have an ISO reference number for it?

I also need documentation on ISO standard SGML equations.  Could someone
give me a reference for a document?

The last one is CALS tables.  I'm using the DOCBOOK DTD for some work and
it doesn't contain a whole lot of documentation for the table part.  All
I'm after is some simple documentation on CALS tables so I can get them
displayed using EBT's DynaText.

I'd appreciate any help anyone can give me with this.  The documents do not
have to be free; I'm willing to pay the usual fees for standards documents.
I just need to know who to call.

Thanks, Torben
</message>
<message id="<2DCE63F1@noak.vxo.telub.se>" date="2977455840">
Newsgroups: comp.text.sgml
Date: 09 May 1994 07:04:00 UT
From: Peter Bergstrom \<pebe@telub.se>
Message-ID: <2DCE63F1@noak.vxo.telub.se>
References: <2DC91527@noak.vxo.telub.se> <2qc1je$g4o@search01.news.aol.com> <2qjj62$85k@search01.news.aol.com>
Subject: Re: HyTime prolog

[Eliot Kimber]

|   Thus, the HyTime declarations can only go *before* the DOCTYPE
|   declaration, e.g.,:
|
|           \<!SGML ...>
|           \<?HyTime VERSION "ISO/IEC 10744:1992" HYQCNT=32 >
|           \<?HyTime MODULE base>
|           \<?HyTime MODULE locs mixspace multloc spanloc >
|           \<?HyTime MODULE links >
|           \<!DOCTYPE Foodoc [...]>
|
|   Note that an SGML declaration cannot contain processing instructions.
|
|   I apologize for any confusion I may have caused.

Thanks, Eliot!
I think my answer to your last post is now overruled, but as you can see
there, you did not cause any big confusion, not for me anyway.

So the correct answer is my second alternative, then. Does anybody have any
clue to if the SGML Parsers and HyTime engines out there recognizes these
processing instructions if I put them at the end of my SGML Declarations
file?
</message>
<message id="<0097E29946C99A00.232002B4@dpsl.UUCP>" date="2977458942">
Newsgroups: comp.text.sgml
Date: 09 May 1994 07:55:42 UT
From: Pamela Gennusa \<dpsl!plg@visionware.co.uk>
Message-ID: <0097E29946C99A00.232002B4@dpsl.UUCP>
Subject: LAST CALL for SGML Europe '94 Year-in-Review News Requests

Dear Users of SGML -

A few weeks ago, we ran this "Call for Bits" and we have received a good
response, much better than in past years.  However, the majority of the
responses have been from vendors rather than users.  Now, that's OK.  We
were glad to receive every "BIT", but we know that vendors make a living in
SGML because Users are USING it.

So, let us know what your up to!


C A L L   F O R   B I T S
_________________________

By now a five-year tradition, the SGML Year in Review continues to open
both the North American and European SGML series of conferences.

As always, it attempts to represent exciting new projects and initiatives
as well as provide updates on on-going work of interest to the conference
attendees and to the readers of this newsgroup and the half-dozen
newsletters that now reprint the presentation.


The SGML Europe Conference is fast approaching, and with it, the Year in
Review Call for Contributions.

THIS IS YOUR INVITATION to sing your own praises, toot your own horn, give
the world a paragraph or two (maximum!) to describe:

        1 Relevant Standards Activity	2 Governmental (or EC/CE or other international) Initiatives
	3 Major Industry Initiatives
	4 New (since May '93) and Forthcoming Publications
	5 Major or Especially Interesting SGML Implementations
        6 Product Announcements (since May '93)
	7 Any SGML Activity that fits in the "Miscellaneous" Category

Of particular interest are updates on any activities reported in previous
Year in Review collections.

EVEN IF YOU ARE PLANNING TO SPEAK ABOUT THE RELEVANT ITEM AT SGML EUROPE,
PLEASE ALSO SEND A FEW SENTENCES FOR THE Year in Review.


***NEW THIS SPRING***

For the first time ever, in keeping with the "news broadcast" style of
presentation, the Year in Review will be accompanied by slides.  You are
invited to send relevant images for your item to Pam Gennusa's address
below.  The graphics should be in colour if possible and may depict
individuals, legible computer screen images, or whatever else you think is
appropriate.  They may be funny.  Please don't send colour slides; prints
only.

Vendors of SGML software are also invited to send announcements of software
that has been released since last May.  Please contact Pam or Yuri for more
details.  (We don't want to spoil the surprise for the rest of you.)

As always, thanks in advance.  Please ensure we have the relevant bits by
May 12, 1994.


Email submissions to:
dpsl!plg@visionware.co.uk	AND  yuri@sq.com

Mail graphics to:
Pam Gennusa, DPSL,
Database Publishing Systems Ltd
608 Delta Business Park
Great Western Way
Swindon, Wilts SN1 1LZ
United Kingdom
Ph:  +44 793 512 515
Fax: +44 793 512 516

You may send news items without images.  The images are a bonus.

---------------------------------------------------------------------
Pamela Gennusa				Yuri Rubinsky
Chair, SGML Europe                      Chair, SGML '94
Director,                               President, SoftQuad Inc.
 Database Publishing Systems Ltd

				###
</message>
<message id="<CpJJrs.Jq5@news.cis.umn.edu>" date="2977485357">
Newsgroups: comp.text.sgml
Date: 09 May 1994 15:15:57 UT
From: R Alexander Milowski \<milor001@maroon.tc.umn.edu>
Organization: University of Minnesota
Message-ID: \<CpJJrs.Jq5@news.cis.umn.edu>
References: \<CpDsMn.2xt@cwi.nl>
Subject: Re: Q: Is SGML what I'm looking for?

[Frank Teusink]

|   I am looking for a way to transform a relational database of `famous
|   persons' (result of a hobby of me and a few others) to something that
|   is more flexible.

[ stuff deleted ]

|   Via the HTML language, I arrived at SGML, which might be a solution to
|   my problem. Ideally, I'd like to transform the database to something
|   like
|   
|   \<person>
|   \<h1>\<name>Henry IV\</name> (Bolingbroke), \<title>king of England\</title>\</h1>
|   
|   Son of \<father>...\</father> and \<mother>...\</mother>. Born on
|   \<date-of-birth>...\</date-of-birth> at ...
|   
|   blabla ... blabla ...
|   \</person>
|   

|   Is this possible within the framework of SGML?
|   Are there `SGML-databases' that can be used for something like this?
|   Where are they?

Well, there are a *few* SGML databases but they are quite expensive.  I
might be worth your while, if you don't have the monies, to use a
flat-file/index scenario.

I have built SGML-systems in the past where indexes were used to speed up
the access to SGML documents.  Then, the documents where pulled into a
application.

In fact, this application was similar in the fact that it did not *need*
SGML but SGML made it easier to handle and much easier to print.  I would
think that if you don't have a large number of people in your database you
could use a flat-file/index structure.

One other thing to consider is a translation from some RDB into SGML first.
Then use the SGML for publishing purposes.

|   As an aside, I would be grateful for references to a good book on SGML.

I swear by Goldfarb's SGML Handbook for programmer/analyst types.  For
authors, I couldn't make a recommendation since I'm not an author. ;)

--
R. Alexander Milowski
SGML Researcher
milor001@maroon.tc.umn.edu
</message>
<message id="<2DCED95C@noak.vxo.telub.se>" date="2977485840">
Newsgroups: comp.text.sgml
Date: 09 May 1994 15:24:00 UT
From: Peter Bergstrom \<pebe@telub.se>
Message-ID: <2DCED95C@noak.vxo.telub.se>
Subject: Data attribute troubles

Question regarding data attributes.

My DTD looks as follows:

    \<!NOTATION raster PUBLIC
    "-//USA-DOD//NOTATION CCITT Group 4 Facsimile Type1 Untiled Raster//EN" >
    \<!ATTLIST #NOTATION raster
                        resolution   NUMBER   "300"  -- Unit:DPI -->
    \<!ELEMENT MyDoc - - (#PCDATA)>

My instance looks like this:

    \<!DOCTYPE MyDoc SYSTEM "mydoc1.dtd" [
        \<!ENTITY MyRaster SYSTEM "myraster.ras" NDATA raster >
    ]>
    \<MyDoc>
        Here comes the text of my document.
        \&MyRaster;
    \</MyDoc>

This gives me an error in Omnimark, since I have already used the notation
in an entity declaration, without the parser knowing it has got an
attribute.  I have both OMITTAG and SHORTTAG specified as YES in the SGML
Declarations.  It doesn't help specifying the attribute in the entity
declaration.  Neither does it help if I omit calling the entity in the
text, the error occurs before that.

Now, if I do the following it works:

My DTD looks as follows:

    \<!NOTATION raster PUBLIC
    "-//USA-DOD//NOTATION CCITT Group 4 Facsimile Type1 Untiled Raster//EN" >
    \<!ELEMENT MyDoc - - (#PCDATA)>

My instance looks like this:

    \<!DOCTYPE MyDoc SYSTEM "mydoc2.dtd" [
        \<!ATTLIST #NOTATION raster
                        resolution   NUMBER   "300"  -- Unit:DPI -->
        \<!ENTITY MyRaster SYSTEM "myraster.ras" NDATA raster >
    ]>
    \<MyDoc>
        Here comes the text of my document.
    \</MyDoc>

With the ATTLIST spec coming before the entity declaration, it's perfectly
all right. I can change the resolution etc.

BUT,

I don't want the ATTLIST being part of the document type declaration subset
in this way.  The attribute will always be the same for all instances using
the DTD, and I can't see any reason for having to specify the attlist
first, except that it doesn't work otherwise.

Can I do this in another way?  Am I really forced to put all data
attributes in the declaration subset, like this?  I have not found any real
clues to this from the SGML Handbook.

Thankful for any help.

Peter
--
Peter Bergstr\&ouml;m           internet:  pebe@telub.se
Telub Inforum AB               phone:     +46 - 470 - 425 97
351 80  V\&Auml;XJ\&Ouml;        fax:       +46 - 470 - 397 96
SWEDEN
</message>
<message id="<9405092307.AA1636@notes.notes>" date="2977488288">
Newsgroups: comp.text.sgml
Date: 09 May 1994 16:04:48 UT
From: Ken Holman \<gkholman@msl.isis.org>
Message-ID: <9405092307.AA1636@notes.notes>
References: <2DCE5E30@noak.vxo.telub.se>
Subject: Re: JPEG and MPEG notations

[Peter Bergstrom]

|   are there any publicly released declaration of these somewhere, which
|   differ from my guess above?

Martin Bryan and I began an ad-hoc project within ISO/IEC JTC1/SC18/WG8
about three weeks ago at the latest international meeting in Arlington,
Virginia.

The draft document N1716 dated April 21, 1994 leads with the paragraph:

"The following notation names and public identifiers are designed to make
it easier for users to identify data encoded using ISO and other commonly
used data interchange formats in document sets that reference data through
ISO 9070 public text object identifiers."

We are in the process of collecting suggestions for formal identifiers for
both ISO and proprietary notations.

The document is not _nearly_ complete, nor polished, as we have only been
able to put in a couple hours of work on it during the week of meetings.
Some of the identifiers were found to be incorrect when we returned to our
resources at our own offices (resources that were not available at the
meeting).  However, we _did_ get started and produced enough of a document
to which we could obtain comments.

We were planning a more formal posting to this group to solicit input.....
but thank you Peter for getting the ball rolling sooner.

|   From this I imagine they could be specfied something like...
|
|    \<!NOTATION JPEG PUBLIC
|        "ISO 10918//NOTATION Joint Photographers Expert Group//EN">
|
|    \<!NOTATION MPEG PUBLIC
|        "ISO 11172//NOTATION Motion Pictures Expert Group//EN">

We were thinking of a more algorithmic approach when dealing with standards
documents so that new standards could be addressed in such a way that
people could create new identifiers without waiting for the notation to be
included in a revision of this document.

Our _draft_ idea for JPEG as published in N1716 will be changed to the
following based on information received since the meeting to:

\<!NOTATION JPEG PUBLIC "ISO/IEC 10918-1:1993//NOTATION Digital
                        compression and coding of continuous-tone
                        still images//EN">

Note the use of the title of the standard document as well as the complete
standard identification (with a "-X" part number if applicable).

Regarding MPEG, there are many parts to the standard that specify different
components (audio, video, up to 1.5 mbit/s, etc), so a number of notations
have been proposed.  We are looking for input from SC29 regarding the
correct titles of documents that are associated with specific data types.

Whether this document becomes a TR (Technical Report) or not remains to be
seen.  We have solicited input from member bodies in the resolutions of the
meeting document N1672:

Recommendation 14. Collection of Notation Declarations: WG8 draws the
attention of experts on data encoding to the information being collected in
N1716, "Notation Declarations for data encoding standards".

Our initial collection was gleaned from numerous publicly available and
privately obtained DTDs.  Needless to say the variety of declarations for
the same formats underscored the usefulness for such a report.

Please remember that we are also collecting notations for proprietary data
types, eg:

\<!NOTATION TeX PUBLIC "+//ISBN 0-201-13448-9::Knuth//NOTATION
                      The TeXbook//EN">
\<!NOTATION eps PUBLIC "+//ISBN 0-201-18127-4::Adobe//NOTATION
                      PostScript Language Reference Manual//EN">
\<!NOTATION cdr PUBLIC "-//Corel//NOTATION CorelDraw File Format//EN">

(note that these suggestions have not yet been commented on by their
respective designers/owners)

Any contributions offered will be greatly appreciated.  We will compile a
new list from contributions received.

Contributions are sent to both Martin and myself when mailed to:

            notation@msl.isis.org

Thanks in advance for all your input..................... Ken

--
Martin Bryan
The SGML Centre                  Phone/Fax: +44 452 714029
29 Oldbury Orchard
Churchdown                       E-mail: mtbryan@cheltenham-he.ac.uk
Glos GL3 2PU

G. Ken Holman                 | Phone:  +1 613 727 5696  x317
Vice President, R\&D           | Fax:    +1 613 727 9491
Microstar Software Ltd.       | US:      1 800 267 9975
34 Colonnade Road North       | BBS:    +1 613 727 5272
Nepean Ontario CANADA K2E-7J6 | E-mail: gkholman@msl.isis.org
</message>
<message id=nil date="2977495915">
Newsgroups: comp.text.sgml
Date: 09 May 1994 18:11:55 UT
From: Len Bullard \<bullardc@source.asset.com>
References: \<CpDsMn.2xt@cwi.nl> \<CpJJrs.Jq5@news.cis.umn.edu>
Subject: Re: Q: Is SGML what I'm looking for?

I concur with Milowski on reference to Goldfarb's SGML Handbook.  For
programmers, it is a good book.  For author's it is too much generally.
Practical SGML by Eric Van Herwijnen (spelled from memory) is the best book
I've used.  I also recommend Martin Bryan's SGML An Author's Guide as a
companion.  SoftQuad used to give out a nice brochure-like guide that
explained the basics very well.

I am an author, so that is from Mr Ed's own automated lips.
</message>
<message id="<1994May10.150017.16895@xgml.com>" date="2977570817">
Newsgroups: comp.text.sgml
Date: 10 May 1994 15:00:17 UT
From: Sam Wilmott \<sw@xgml.com>
Organization: Exoterica Corporation
Message-ID: <1994May10.150017.16895@xgml.com>
References: <2DCED95C@noak.vxo.telub.se>
Subject: Re: Data attribute troubles

[Peter Bergstrom]

|   My DTD looks as follows:
|   
|   \<!NOTATION raster PUBLIC
|   "-//USA-DOD//NOTATION CCITT Group 4 Facsimile Type1 Untiled Raster//EN" >
|   \<!ATTLIST #NOTATION raster
|                       resolution   NUMBER   "300"  -- Unit:DPI -->
|   \<!ELEMENT MyDoc - - (#PCDATA)>
|   
|   My instance looks like this:
|   
|   \<!DOCTYPE MyDoc SYSTEM "mydoc1.dtd" [
|       \<!ENTITY MyRaster SYSTEM "myraster.ras" NDATA raster >
|   ]>
|   \<MyDoc>
|       Here comes the text of my document.
|       \&MyRaster;
|   \</MyDoc>
|   
|   This gives me an error in Omnimark, ...

A paragraph of Clause 10.5 of ISO 8879, and the following note state:

   An attempt to redefine an entity is ignored, but is not an error.

   NOTE: This requirement allows an entity declaration in a document type
      declaration subset to take priority over a later declaration of the
      same entity in a public document type definition.

The second text paragraph of Clause 11.1 of ISO 8879, and the following
note state:

   The external identifier points to an entity that is referenced at the
   end of the declaration subset and is considered to be part (or all) of
   it. The effective document type definition is the combination of the
   declarations entered in the subset and the external ones.

   NOTE: A parameter entity declaration in the subset will have priority
      over another declaration for that entity in the external entity, as
      the external entity is parsed later.

The first text paragraph of Clause 11.4 of ISO 8879 states:

   The notation name cannot be specified on another notation declaration in
   the same document type definition.

The first text paragraph of Clause 11.4.1.2 of ISO 8879 (added by Amendment
1) states:

   The validity of a data attribute specification is determined by the
   attribute definition list associated with the data content notation.
   The attribute definition list must have been declared prior to this
   declaration.

As a consequence of 11.1, the declaration(s) in the explicit DOCTYPE are
processed first, and those in "mydoc1.dtd" second.  As a consequence of
11.4.1.2, the ATTLIST for the attributes of the notation used by the entity
"MyRaster" must occur prior to the ENTITY declaration.  Which means that
the ATTLIST cannot occur in "mydoc1.dtd" and the ENTITY declaration in the
DOCTYPE part.  Duplicating the NOTATION declaration in both parts does not
help because of 11.4.

The motivation for the order in which the DOCTYPE part and "mydoc1.dtd" are
processed is provided by 10.5 and the note in 11.1: because the DOCTYPE
part is parsed first, and all but the first ENTITY declaration for an
entity are ignored, ENTITY declarations in the DOCTYPE part can be used to
"parameterize" "mydoc1.dtd", and "mydoc1.dtd" can contain "default" entity
declarations that are used if (prior) ones don't appear in the DOCTYPE
part.  However, this can only be done with entities.  In particular, by
11.4, a second NOTATION declaration is in error.

The solution to the difficulty described in Peter's message is to declare
"mydoc1.dtd" as an entity in the DOCTYPE part and reference it prior to the
ENTITY declaration, rather than referencing the entity from the DOCTYPE
header, as follows:

    \<!DOCTYPE MyDoc [
        \<!ENTITY % mydoc1 SYSTEM "mydoc1.dtd">
        %mydoc1;
        \<!ENTITY MyRaster SYSTEM "myraster.ras" NDATA raster >
    ]>

This forces "mydoc1.dtd" to be processed first and gets the NOTATION and
ATTLIST declarations prior to the ENTITY declarations, where you want them
to be.

If you are having trouble with Omnimark, either as a customer with a
support contract or as a potential customer evaluating our sampler, don't
hesitate to contact our support department by phone (+1 613 722 1700), fax
(+1 613 722 5706) or email (support@exoterica.com).

Regards.
--
Sam Wilmott, Exoterica Corp.
sw@exoterica.com  voice (613) 722-1700 fax (613) 722-5706
Product information: info@exoterica.com
</message>
<message id="<9405101516.AA33052@source.asset.com>" date="2977571792">
Newsgroups: comp.text.sgml
Date: 10 May 1994 15:16:32 UT
From: Len Bullard \<bullardc@source.asset.com>
Message-ID: <9405101516.AA33052@source.asset.com>
References: \<Robbin.9.00109A84@[140.138.150.213]>
Subject: Re: How can I get information about SMSL?

Send email to bmarkey@world.std.com or

    Permanent Wave Productions
    18 Harris Avenue
    Shrewsbury, Mass 01545.

At last accounting, Bryan Markey is the editor and can tell
you about status.
</message>
<message id="<Robbin.9.00109A84@[140.138.150.213]>" date="2977576566">
Newsgroups: comp.text.sgml
Date: 10 May 1994 16:36:06 UT
From: Robbin Wong \<robbin@[140.138.150.213]>
Organization: Dep. Computer Sci. & Information Eng., Chiao Tung Univ., Taiwan, R.O.C
Message-ID: \<Robbin.9.00109A84@[140.138.150.213]>
Subject: How can I get information about SMSL?

How can I get information about SMSL?

The Standard Multimedia Scripting Language (SMSL) is a joint project
between SC18/WG8, the developers of SGML and HyTime, and SC29/WG12, the
Multimedia /Hypermedia Expert Group (MHEG) which started in 1992.

If you know something about the Standard, please reply to me.

Reply will be appreciate and thank you for your reading.

Robbin Wong in Chinese Taiwan.
</message>
<message id="<sdcf8488.007@WordPerfect.com>" date="2977582478">
Newsgroups: comp.text.sgml
Date: 10 May 1994 18:14:38 UT
From: Kerry Burton \<KERRY@WordPerfect.com>
Message-ID: \<sdcf8488.007@WordPerfect.com>
References: <2DCED95C@noak.vxo.telub.se> <1994May10.150017.16895@xgml.com>
Subject: Re: Data attribute troubles

[Peter Bergstrom]

|   My DTD looks as follows:
|
|   \<!NOTATION raster PUBLIC "-//USA-DOD//NOTATION CCITT Group 4
|                       Facsimile Type1 Untiled Raster//EN" >
|   \<!ATTLIST #NOTATION raster
|                       resolution   NUMBER   "300"  -- Unit:DPI -->
|   \<!ELEMENT MyDoc - - (#PCDATA)>
|
|   My instance looks like this:
|
|   \<!DOCTYPE MyDoc SYSTEM "mydoc1.dtd" [
|       \<!ENTITY MyRaster SYSTEM "myraster.ras" NDATA raster >
|   ]>
|   \<MyDoc>
|       Here comes the text of my document.
|       \&MyRaster;
|   \</MyDoc>
|
|   This gives me an error in Omnimark...

[Sam Wilmott]

|   The solution to the difficulty described in Peter's message is to
|   declare "mydoc1.dtd" as an entity in the DOCTYPE part and reference it
|   prior to the ENTITY declaration, rather than referencing the entity
|   from the DOCTYPE header, as follows:
|
|      \<!DOCTYPE MyDoc [
|          \<!ENTITY % mydoc1 SYSTEM "mydoc1.dtd">
|          %mydoc1;
|          \<!ENTITY MyRaster SYSTEM "myraster.ras" NDATA raster >
|      ]>
|
|   This forces "mydoc1.dtd" to be processed first and gets the NOTATION
|   and ATTLIST declarations prior to the ENTITY declarations, where you
|   want them to be.

Omnimark is definitely handling things according to the standard.  I ran
across the same question a year or two ago, and in my opinion this problem
indicates a "hole" in the standard.

Sam's solution works fine for Peter's particular situation, but what will
Peter do when he wants to create "local" entity declarations to override
those in his base DTD?  Does he need to do something like the following?

    \<!DOCTYPE MyDoc [
	\<!ENTITY BaseEnt1 "First Local Entity">
	\<!ENTITY BaseEnt2 "Second Local Entity">
	\<!ENTITY % mydoc1 SYSTEM "mydoc1.dtd">
	%mydoc1;
	\<!ENTITY MyRaster SYSTEM "myraster.ras" NDATA raster >
    ]>

While certainly legal, this seems to go against the "flavor" suggested by
the use of external declaration subsets.  It's not Sam's or Omnimark's
fault; there's no way around it!

For some SGML editors which use precompiled DTDs, this does not work well
at all.  Sometimes the DTD to be precompiled is prohibited from having a
DOCTYPE declaration.  This allows the DTD to be referenced as an external
declaration subset in a generated DOCTYPE declaration when the document is
exported as SGML.  It is my guess that this problem in the standard is the
reason that some of these products have chosen to ignore data attributes
altogether.

The addition of data attributes in Amendment 1 fit in with the original
standard ALMOST seamlessly, but this is one problem which either slipped by
the reviewers or was deemed unresolvable and left that way.

Are there any standard committee members out there who could fill us in on
the reason for the incongruity, or what plans (if any) exist to resolve it?

--
Kerry Burton                  Phone:  (801) 222-7432
WordPerfect Corporation         FAX:  (801) 222-7477
MS E122                       Email:  kerry@wordperfect.com
1555 N. Technology Way          "I speak for myself, not necessarily
Orem, UT  84057                  for WordPerfect (or Novell)"
</message>
<message id="<ROBIN.94May10141053@utafll.utafll.uta.edu>" date="2977589453">
Newsgroups: comp.text.sgml
Date: 10 May 1994 20:10:53 UT
From: Robin Cover \<robin@utafll.uta.edu>
Organization: UT Arlington
Message-ID: \<ROBIN.94May10141053@utafll.utafll.uta.edu>
Subject: New Book on HyTime (DeRose & Durand) 

The new book of HyTime by Steve DeRose and David Durand is introduced in a
file I found on Kluwer's public information disk.  It contains a detailed
Table of Contents listing, Foreword to the book (by Andy van Dam), ordering
information, etc.  Prepublication discount is offered through June 30th.
I'll include below the TOC below, but the full file is available via
anonymous FTP:

               ftp://ftp.std.com/ftp/Kluwer/books/hytime

Robin Cover
(hope Steve and David don't mind this publicity...)
------------------------------------------------------------------------

Making Hypermedia Work: A User's Guide to HyTime
by Steve J. DeRose and David G. Durand

CONTENTS

Contents  i
Foreword  xi
Conventions    xv
Acknowledgments     xvii
PART I
1  Introduction     3
1.1 Purpose and scope of this book 6
1.2 What is HyTime for?  8
1.3 What can HyTime do?  9
1.4 How to use this book 10
2  Hypermedia Concepts   13
2.1 Introduction    13
2.2 Print examples of hyperdocuments    14
2.3 Documents and links  21
2.4 Open systems    27
3  Overview of SGML 35
3.1 What is SGML, really?     35
3.2 Parts of an SGML document 39
3.3 SGML basics     40
3.4 A sample SGML document    51
3.5 Optional SGML features    58
3.6 Formal public identifiers 61
4  HyTime Quick Start    65
4.1 Clink, the basic point-to-point link     65
4.2 Linking to destinations without IDs 67
4.3 HyTime locators 68
4.4 Graphic locations and geometry 70
4.5 Linking by searching 71
4.6 More complex link structures   72
4.7 HyTime and SGML element types  74
5  The Structure of HyTime    77
5.1 Basic HyTime concepts     77
5.2 Architectural forms  79
5.3 SGML issues in HyTime     90
5.4 HyTime's hyperdocument model   92
5.5 How a HyTime system fits together   94
5.6 HyTime declarations  97
5.7 The HyTime modules   99
PART II
6  Basic Measurement     103
6.1 Introduction to measurement    103
6.2 Geometric concepts   104
6.3 Architectural forms  105
7  Hyperlinks  109
7.1 Introduction    109
7.2 Basic Concepts  109
7.3 Architectural forms  114
7.4 Hyperlinks and webs  118
8  Locating Data Objects 121
8.1 Introduction    121
8.2 Naming, counting and querying  121
8.3 Named location addresses  123
8.4 Coordinate location addresses  131
8.5 Semantic locations   145
8.6 Creating robust pointers  151
8.7 Aggregates and spans 153
9  The HyTime Query Language  159
9.1 Purpose and basic capabilities 159
9.2 An example 160
9.3 HyQ syntax 162
9.4 The HyQ functions    163
9.5 HyQ and other query languages  186
10  Modifying a DTD for HyTime     189
10.1 Introduction   189
10.2 HTML and the World Wide Web   190
10.3 The DTD conversion  195
10.4 Evaluating a HyTime product   202
10.5 Summary   203
PART III
11  Lexical Types and Properties   207
11.1 Introduction   207
11.2 Introduction to lexical types 208
11.3 HyTime lextype type constraints    210
11.4 Defining lexical models  210
11.5 HyLex     213
11.6 Accessing public lexical types     220
11.7 HyTime properties   222
11.8 Property definition 227
11.9 Inherent property definitions 230
12  Extending HyTime     237
12.1 Introduction   237
12.2 HyQ extensions 238
12.3 Lexical types  240
12.4 Properties     241
12.5 Specific elements for media links  241
12.6 A protocol for hot spots on graphs 246
12.7 TEI  248
13  Advanced Measurement and Scheduling 253
13.1 Introduction   253
13.2 Units of measurement     254
13.3 Axes and finite coordinate spaces  257
13.4 Events and extents  260
13.5 Representing tables 267
13.6 Marker functions    271
13.7 Other features 273
14  The HyTime Base Module    275
14.1 Introduction   275
14.2 Entity trees   276
14.3 Base module option summaries  276
14.4 The all-id attribute form     280
14.5 The all-lex attribute form    284
14.6 The lexmodel architectural form    285
14.7 The lexord architectural form 285
14.8 The all-ref attribute form    287
14.9 The all-act    292
14.10 The any-dcn   293
14.11 The all-qual  294
APPENDIXES
Appendix A  HyTime Meta-DTD   295
A.1 Base module (clause 6)    296
A.2 Measurement (clause 7)    301
A.3 Location address (clause 8)    305
A.4 Hyperlinks (clause 9)     309
A.5 Scheduling (clause 10)    310
A.6 Rendition (clause 11)
A.7 Useful types and notations     317
Measurement (annex A.5)  318
Appendix B  Graphics Notations     321
Appendix C  HyTime Reference Material   327
Bibliography   333
Glossary  341
Index     359

FOREWORD

HyTime is the first official standard for describing the structure of
time-based hypermedia documents. Its marketplace acceptance is yet to come.
This book, presenting the first in-depth guide to the HyTime specification,
both describes its key features and provides guidelines on how it is used.
It has been written by two leading experts in the field who have had
significant impact on its development and are experts in SGML, the prior
standard that HyTime extends.
[... continues]
</message>
<message id="<2qp73j$mbi@lovecraft.convex.com>" date="2977603123">
Newsgroups: comp.infosystems.www,comp.text.sgml,comp.lang.perl
Date: 10 May 1994 23:58:43 UT
From: Earl Hood \<ehood@convex.com>
Organization: Engineering, Convex Computer Corporation, Richardson, Tx USA
Message-ID: <2qp73j$mbi@lovecraft.convex.com>
Keywords: SGML, HTML, Web, Perl
Summary: Hypertext navigation of SGML DTDs
Subject: SGML DTD navigation with the Web

(This message is being sent out to what I believe are interested parties.)

Ever wondering if you can put a paragraph tag in a heading?  What elements
can contain a footnote?  Can I put a graphic inside of a table?

If you've asked yourself questions like above, then I have the program for
you: dtd2html.

Dtd2html is a Perl program that generates an HTML document (composed of
several files) to allow hypertext navigation through an SGML DTD.  Now one
can easily find out what elements can contain what.  Dtd2html determines
what are the top-most elements in a DTD, and then allows a user to navigate
down (or up) the DTD element structure via hyperlinks.

Dtd2html also generates an HTML page containing a list of all elements so
you can quickly jump to an element page you want.  Dtd2html can also
generate a page so one can see the overall element hierarchy defined by the
DTD.

Here's an example of what an element page looks like from the HTML DTD
(This was generated thru Mosaic's save file option using formatted text;
therefore, all the hyperlinks do not show up):

---------------------------------------------------------------------------

HEAD
****

The HEAD element contains all information about the document in general.
It does not contain any text which is part of the document: this is in the
BODY.

Content
=======

 o base
 o isindex
 o link
 o nextid
 o title

Tag Minimization
   Open Tag: REQUIRED
   Close Tag: REQUIRED

See ATTRIBUTES
See CONTENT DECLERATION

Parent Elements
===============

 o html

Goto top element list.
Goto all element list.
Goto tree.

Goto HTML DTD main page.

HTML DTD

---------------------------------------------------------------------------

The element page contains hyperlinks to other element pages; a page
describing the attributes of the element; a page giving the actual SGML
syntax that defines the element; and links to the top element list, all
element list, tree, and home pages.

I've currently used dtd2html on the following DTDs:

	DocBook, OpenBook, Rainbow, HTML, HTML+, OSF Book

I've added in the ability for dtd2html to add in descriptions of the
elements in the DTD.  This way, one can use dtd2html as way of documenting
a DTD.

Is anyone interested in such a program?  I've created a tar file containing
the program and the DTDs I've converted.  The tar file also contains the
following:

    dtdtree
	dtdtree outputs the content hierarchy tree (in ASCII) of SGML
	elements defined in a DTD.

    dtd.pl
	A Perl library that parses an SGML document type defintion (DTD)
	and builds up data structures containing the structural content of
	the DTD.  Various convienance routines are defined to access the
	data structures.

	dtd2html and dtdtree utilize dtd.pl to do their work.

If there is interest, I would like to see if anyone/site is willing to
archive the stuff for distribution.  Currently, our company is still
working on geting a public Web server running, so it may be awhile before I
can put it there.

Following is the text file describing dtd2html:


dtd2html
*********

dtd2html is a Perl program that generates an HTML document (composed of
several files) to allow hypertext navigation through an SGML DTD.



Description
============

dtd2html generates various HTML files for hypertext navigation of a SGML
DTD.  The files generated are as follows:

DTD-HOME.html
   This file is the home page of the HTML document.  This file contains to
   the basic links to start navigating through the DTD.  The name of this
   file can be changed with the -homename option.

TOP-ELEM.html
   This file lists the top-most elements of the DTD, and contains the links
   to element pages describing each top-most element.  The name of this
   file can be changed with the -topname option.

ALL-ELEM.html
   This file contains a list of all elements defined in the DTD.  This page
   allows quick access to any individual element description page.  The
   name of this file can be changed with the -allname option.

DTD-TREE.html
   (Optional) This file contains the content heierachy tree(s) of the
   top-most element(s) in the DTD.  This file is only generated if the
   -tree option is specified during program invocation.  The name of this
   file can be changed with the -treename option.

element.html
   For each element defined in the DTD, an element description file is
   generated with a filename of the element name suffixed by ".html".

element.attr.html
   For each element defined in the DTD, a file is generated describing the
   attributes defined for the element.

element.cont.html
   For each element defined in the DTD, a file is generated listing the
   content model decleration of the element as declared in the DTD.

Once all the files are generated, one needs only to create a link in the
Web server being used to the DTD-HOME.html file.

More information on the content of each file is in the HTML File
Descriptions section.



Usage
======

dtd2html is invoked from a Unix shell, with the following syntax:

% dtd2html [options] filename

filename is the SGML DTD to be parsed for generating the HTML files.  The
following is the list of options available:

-allname filename
   Set the filename for file listing all elements in the DTD to filename.
   The default name is "ALL-ELEM.html".

-descfile filename
   Use filename as the source for element descriptions in the DTD.  If this
   argument is not specified, no description file is used.  See Element
   Description File for more information.

-dtdname string
   Set the name of the DTD to string.  If not specified, dtd2html
   determines the name of the DTD by its filename with the extension
   stripped off.  If reading from standard input, then this argument should
   be specified.  Otherwise, "Unknown" is used.

   The string " DTD" will be appended to the name of the DTD.

-elemlist
   Generate a blank element description file (see Element Description
   File).  No HTML files are generated and all other options are ignored
   when this option is specified.

-help
   Print out a terse description of all options available.  No HTML files
   are generated and all other options are ignored when this option is
   specified.

-homename filename
   Set the filename for the HTML home page for the DTD to filename.  The
   default name is "DTD-HOME.html".

-level #
   Set the prune level of the content hierachy tree to #.  This option is
   only valid if -tree is specified.

-mapfile filename
   Set the entity to map file to filename.  The default is "map.txt".  See
   DTDread_mapfile of package dtd for more information.

   The entity map file is only necessary if element/attribute markup are
   contained in multiply files, and the files are referenced by external
   parameter entities.  dtd2html ignores general entities since they do not
   define document structure.

-outdir path
   Set destination of generated HTML files to path.  Defaults to the
   current working directory.

-topname filename
   Set the filename for file listing the top-most elements in the DTD to
   filename.  The default name is "TOP-ELEM.html".

-tree
   Generate the content hierarchy of the top-most elements defined in the
   DTD.

-treename filename
   Set the filename for file containing the content hierarchy tree(s) of
   the DTD to filename.  The default name is "DTD-TREE.html".  This option
   is only valid if -tree is specified.

-treetop string
   Set the top-most elements to string.  String is a comma separated list
   of elements that dtd2html should treat as the top-most elements when
   printing the content hierarchy tree(s).  This option is only valid if
   -tree is specified.

   Normally, dtd2html will compute what are the top-most elements of the
   DTD.  This option overrides that computation for the sake of of printing
   the content hierarchy trees.  This option has no affect on the top-most
   element list presented "TOP-ELEM.html" (or the file specified by
   -topname).


HTML File Descriptions
=======================

All HTML files/pages generated contain hypertext links at the end of the
page to the DTD-HOME, TOP-ELEM, ALL-ELEM, and DTD-TREE (if specified)
files, unless stated otherwise.

DTD-HOME
+++++++++

This page is the root of the HTML document.  It contains the links to the
other main pages as described above.  It also contains a brief paragraph
mentioning that this document was created by dtd2html.

This file may (should) be editted to add further description of the DTD,
and/or contain links to other documents about the DTD.

TOP-ELEM
+++++++++

This page contains the list of all top-most elements defined in the DTD.  A
top-most element is defined as: An element which cannot be contained by
another element or can be only contained by itself.

ALL-ELEM
+++++++++

This page contains a list of all elements defined in the DTD.

DTD-TREE
+++++++++

This page contains the content hierarchy tree(s) of the top-most elements
of the DTD.  See DTDprint_tree of package dtd for a detailed description
about the content hierarchy tree(s).

element
++++++++

element.html is the file describing the content of element.  The element
page is divided into the following sections:

 o element description (optional, see Element Description File).
 o Links to subelements broken into three parts:
    o Base content
    o Inclusions (if defined)
    o Exclusions (if defined)
 o Tag minimization (if defined).
 o Links to element's attribute (element.attr) and SGML decleration
   (element.cont) pages.
 o Links to all possible parent elements.
 o Links to main pages as described above.

element.attr
+++++++++++++

element.attr.html is the file describing the attributes of element.  The
element.attr page is divided into the following sections:

 o attribute(s) description (optional, see Element Description File).
 o List of attributes with possible values and default value.
 o Link back to element page.

element.cont
+++++++++++++

element.cont.html is the file giving the element content model decleration
as it appears in the DTD.  The element.cont page is divided into the
following sections:

 o Base content decleration.
 o Inclusion content decleration (if defined).
 o Exclusion content decleration (if defined).
 o Link back to element page.



Element Description File
=========================

When dtd2html is invoked, a file containing descriptions of the various
elements defined in the DTD may be specified (-descfile).  These
descriptions will appear in the element.html and element.attr.html files.
The syntax of the description file is as follows:

         \<?DTD2HTML element1>
         Description of element1 here.
         \<P>
         \<?DTD2HTML element2>
         Description of element2 here.
         \<P>
         ...

The line \<?DTD2HTML element> signifies the beginning of the element
description text.  All text up to the next \<?DTD2HTML element> line or
end-of-file is used as the element description.

If element in the description file is suffixed by a "*", then the following
description text will put in the element's attribute page.  Otherwise, the
description text appears in the element page.

The description text can be any valid HTML markup.  It is recommended to
end each description with the \<P> tag so the description is separated from
any other HTML markup generated by dtd2html.  An \<HR> (horizontal rule) tag
is put after the description text by dtd2html.

To get started with a description file for a DTD, one can use the -elemlist
option to dtd2html to generate a file with all elements defined in the DTD
with empty descriptions.



See Also
=========

dtdtree
   Generate content hierarchy trees of SGML elements.

dtd.pl
   A Perl library to parse SGML DTDs.


Earl Hood, ehood@convex.com
--
    Earl Hood                |   CONVEX Computer Corporation
    ehood@convex.com         |   3000 Waterview Parkway
    Phone: (214) 497-4387    |   P.O. Box 833851
    FAX: (214) 497-4500      |   Richardson, TX  75083-3851
</message>
<message id="<9405111702.AA0524@notes.notes>" date="2977639120">
Newsgroups: comp.text.sgml
Date: 11 May 1994 09:58:40 UT
From: Matt Timmermans \<mtimmerm@msl.isis.org>
Message-ID: <9405111702.AA0524@notes.notes>
References: <2DCED95C@noak.vxo.telub.se> <1994May10.150017.16895@xgml.com> \<sdcf8488.007@WordPerfect.com>
Subject: Re: Data attribute troubles

[Kerry Burton]

|   \<!DOCTYPE MyDoc [
|       \<!ENTITY BaseEnt1 "First Local Entity">
|       \<!ENTITY BaseEnt2 "Second Local Entity">
|       \<!ENTITY % mydoc1 SYSTEM "mydoc1.dtd">
|       %mydoc1;
|       \<!ENTITY MyRaster SYSTEM "myraster.ras" NDATA raster >
|   ]>
|
|   While certainly legal, this seems to go against the "flavor" suggested
|   by the use of external declaration subsets.  It's not Sam's or
|   Omnimark's fault; there's no way around it!

This gives me an opportunity to flog my favourite dead horse in a somewhat
more eloquent way: The "flavor" suggested by the use of external
declaration subsets is wrong.

There is no difference between:

    \<!DOCTYPE cow SYSTEM "cow.dtd"
    [
    ... other declarations ...
    ]>

and
    \<!DOCTYPE cow
    [
    \<!ENTITY % thedtd SYSTEM "cow.dtd">
    ... other declarations ...
    %thedtd;
    ]>

The first example _seems_ to imply that the document somehow conforms to
cow.dtd, but this is not the case.  There is no way in SGML to declare
conformance to a publicly known content model.  This does not indicate a
hole in the Standard.  It indicates, rather, a false inferrence made by
_everybody_ (myself included) who undertakes to learn SGML.

\</Matt>

--
Matt Timmermans               | Phone:  +1 613 727 5696
Microstar Software Ltd.       | Fax:    +1 613 727 9491
34 Colonnade Rd. North        | BBS:    +1 613 727 5272
Nepean Ontario CANADA K2E-7J6 | E-mail: mtimmerm@msl.isis.org
</message>
<message id="<2qqqcg$ptm@rs18.hrz.th-darmstadt.de>" date="2977655632">
Newsgroups: comp.infosystems.www,comp.text.sgml,comp.lang.perl
Date: 11 May 1994 14:33:52 UT
From: Joachim Schrod \<schrod@iti.informatik.th-darmstadt.de>
Organization: TH Darmstadt, FG Systemprogrammierung
Message-ID: <2qqqcg$ptm@rs18.hrz.th-darmstadt.de>
References: <2qp73j$mbi@lovecraft.convex.com>
Keywords: SGML, HTML, Web, Perl
Subject: Re: SGML DTD navigation with the Web

[Earl Hood]

|   Ever wondering if you can put a paragraph tag in a heading?  What
|   elements can contain a footnote?  Can I put a graphic inside of a
|   table?
|
|   If you've asked yourself questions like above, then I have the program
|   for you: dtd2html.

And if you would tell us an ftp site, we would be most happy... :-)

Cheers,
	Joachim

--
Joachim Schrod			Email: schrod@iti.informatik.th-darmstadt.de
Computer Science Department
Technical University of Darmstadt, Germany
</message>
<message id="<9405111956.AA18658@wpdis03.wpafb.af.mil>" date="2977674954">
Newsgroups: comp.text.sgml
Date: 11 May 1994 19:55:54 UT
From: John Knauer \<knauerj@wpdis03.wpafb.af.mil>
Message-ID: <9405111956.AA18658@wpdis03.wpafb.af.mil>
References: \<CpIJv2.HpE@news.Hawaii.Edu>
Keywords: Equations, AAP
Subject: Re: Documentation

[Torben Noerup Nielsen]

|   For the last several weeks, I've been hunting around for documentation
|   on a couple of standards.  But without luck.
|
|   I'm looking for documentation for the AAP DTD.  I have heard that this
|   is now an ISO standard.  Does anyone have an ISO reference number for
|   it?
|
|   I also need documentation on ISO standard SGML equations.  Could
|   someone give me a reference for a document?
|
|   The last one is CALS tables.  I'm using the DOCBOOK DTD for some work
|   and it doesn't contain a whole lot of documentation for the table part.
|   All I'm after is some simple documentation on CALS tables so I can get
|   them displayed using EBT's DynaText.
|
|   I'd appreciate any help anyone can give me with this.  The documents do
|   not have to be free; I'm willing to pay the usual fees for standards
|   documents.  I just need to know who to call.

In regards to your request for information regarding CALS tables, there is
a file available via anonymous ftp which documents the table model in
MIL-M-38784D.  This specification is currently out for official
coordination.  Currently it is anticipated that this will be officially
released around September.

There were a number of improvements made in the table model in the current
version of MIL-M-38784C AMENDMENT 3.  The changes and the reasons for them
are also contained in this file.  This should help you understand the
documentation for MIL-M-38784D as it applies to the DTD you are using.

Whenever I see the phrase CALS tables I get a bit nervous because what is
usually being referred to is the table model of Appendix A in MIL-M-28001B.
This DTD has a number of problems.  It was placed there only for
illustration purposes and is duly noted as such, but still has been
interpreted by many to be the CALS DTD.  This interpretation is incorrect.
This DTD is not being used by Air Force in any official capacity.

To access this file login to WPAFTB1.WPAFB.AF.MIL as FTP and then type cd
sgml.  The file is table.doc.  There are also a number of files relating to
the Air Force CALS effort in this subdirectory.  The file filelist.txt
contains a listing of all the files in this subdirectory.

--
John Knauer		     knauerj@wpdis03.wpafb.af.mil
RJO Enterprises AFMC\\ENCS
</message>
<message id="<CpnMv8.6AG@world.std.com>" date="2977676035">
Newsgroups: comp.text.sgml
Date: 11 May 1994 20:13:55 UT
From: William L Trippe \<trippe@world.std.com>
Organization: The World Public Access UNIX, Brookline, MA
Message-ID: \<CpnMv8.6AG@world.std.com>
Keywords: programming
Summary: Software programmers wanted in Boston
Subject: Programming Positions in Boston

    WANTED: Senior Software Development Engineer
    WANTED: Software Development Engineer

            Boston, MA.  Will Not Relocate.

NO E-MAIL REPLIES, PLEASE!!

InfoSoft International Inc. is a rapidly growing software developer that
recently went public.  (InfoSoft was formerly the Software Division of
Houghton Mifflin Company.)  Our present team of 100 professionals is
predominantly software engineers and linguists.  InfoSoft International has
established a line of multilingual tools and reference works that are
licensed to software developers and hardware manufacturers worldwide.

Our platform-independent linguistic software products include: a spelling
checker and corrector that works in 18 languages and dialects, a grammar
correction system for American and British English, and a database
retrieval system that can access electronic works such as dictionaries,
thesauruses, and encyclopedias in many languages.  We are also developing a
new line of information management tools.

All products are licensed to software OEMs who incorporate InfoSoft's
software within their applications.  Products are licensed as C source code
and as such need to be portable to a wide range of computer platforms and
operating systems including: MS-DOS, OS/2, MS Windows, Windows NT,
Macintosh, VAX/VMS, and UNIX.


The IntelliFinder is a research and development effort that is focused on
information storage and retrieval technology.  We work primarily with
highly structured static reference works in many languages.


===========================================================================

Senior Software Development Engineer:

This position requires a forward thinking senior engineer with a focus on
design and software architecture.  The engineer must have experience with
an entire product cycle, especially the design and specification stages.
The engineer should have a demonstrated ability to document ideas and
concepts.  If possible we would like to see previous design documents.
Previous experience with project leadership is desirable.

The candidate should be strong in C, should have 7 or more years experience
developing software, and should have recent experience developing for
personal computers (PCs or Mac).  Experience with cross platform
development, data compression and multimedia are desirable.  Knowledge of
document mark-up and parsing (SGML) is also desirable.  Knowledge of
foreign languages is a plus.

Initial tasks include specification of many of the new sub-systems.  The
engineer will help the project leader in reviewing all specification work
done by other sources, and will be expected to review and contribute to all
designs.  The position will require code development and code review (for
standards and performance).

===========================================================================

Software Development Engineer:

This is a junior coding and development position.  The candidate should
have strengths in coding new systems to specification.  Also needed are
skills in performance analysis, and integration of code.  Experience with
the C programming language in as many environments as possible is a plus.
Experience with Windows NT, UNIX, or Macintosh or OS/2 is desired in
addition to experience with the Windows development environment.

Demonstrated coding ability and the ability to learn new concepts
quickly is needed.  Knowledge of foreign languages is a plus.


===========================================================================

Summary:

InfoSoft International Inc. offers an excellent opportunity for computer
professionals to gain valuable experience in software development.  We
develop products for the latest computers, operating systems, and
compilers, offering a unique opportunity to get hands-on experience with
the newest development tools and systems.

The pace of the group is quick and the work will be challenging.  However,
the opportunity for acquiring valuable experience in the software industry
is tremendous and the work will be very rewarding.


Please send or fax resume and salary requirement to:

Pat Lewis
InfoSoft International Inc.
222 Berkeley Street
Boston, MA 02116
Fax #: 617-351-1115

NO E-MAIL REPLIES, PLEASE!!
</message>
<message id="<2qrf5t$5ig@panix.com>" date="2977676925">
Newsgroups: comp.text.sgml
Date: 11 May 1994 20:28:45 UT
From: Henry Chung \<hchung@panix.com>
Organization: PANIX Public Access Internet and Unix, NYC
Message-ID: <2qrf5t$5ig@panix.com>
Keywords: SGML parser
Subject: [Q] Whereabouts of a public domain parser?

I am trying to locate the public domain parser.  I had downloaded it once a
while ago but have long since deleted it off my system.  Recent archie
searches have turned up nothing.  Does anyone know where I might find it?
Any help would be much appreciated.  Thanks.

--
Henry Chung (hchung@panix.com)
Technical Director
US Lynx
</message>
<message id="<2qrkjm$b0h@lovecraft.convex.com>" date="2977682486">
Newsgroups: comp.infosystems.www,comp.text.sgml,comp.lang.perl
Date: 11 May 1994 22:01:26 UT
From: Earl Hood \<ehood@convex.com>
Organization: Engineering, Convex Computer Corporation, Richardson, Tx USA
Message-ID: <2qrkjm$b0h@lovecraft.convex.com>
Summary: ftp sites for dtd2html
Subject: DTD2HTML NOW AVAILABLE (was Re: SGML DTD navigation ...)

dtd2html is now available (or will be soon) at the following locations:

    ftp://ftp.law.indiana.edu/pub/www/
    ftp://ftp.funet.fi/pub/networking/services/www/misc/
    http://caesar.cs.uiowa.edu/~jollie/dtd2html/
    ftp://ftp.ifi.uio.no//pub/SGML/DTD2HTML/

    and at ftp.utdallas.edu once a Web page is written for it.

More may get added over time.  I'd like to setup a standard mechanism for
providing later versions (if any) to published.

There are 2 files in the distribution:

    dtd2html.tar
	Tar file containing the distribution of dtd2html and
	other related tools and files.

    dtd2html.examples.tar
	Tar file of DTDs processed with dtd2html, and sample
	output from dtdtree.


Thanks to those willing to archive the files.  All locations listed above
are subject to change without notice.

If there are any bug/comments/suggestions about dtd2html and related tools,
please drop me a message.

	--ewh
--
    Earl Hood                |   CONVEX Computer Corporation
    ehood@convex.com         |   3000 Waterview Parkway
    Phone: (214) 497-4387    |   P.O. Box 833851
    FAX: (214) 497-4500      |   Richardson, TX  75083-3851
</message>
<message id="<9405121304.AA16979@source.asset.com>" date="2977736668">
Newsgroups: comp.text.sgml
Date: 12 May 1994 13:04:28 UT
From: Len Bullard \<bullardc@source.asset.com>
Message-ID: <9405121304.AA16979@source.asset.com>
References: \<CpIJv2.HpE@news.Hawaii.Edu> <9405111956.AA18658@wpdis03.wpafb.af.mil>
Subject: Re: Documentation

Isn't MIL-M-28001B the DTD that is being quoted for contracting purposes?
If there is confusion, it is widespread and spreading faster.  I've
received several phone calls lately in which I'm asked about tools to
support production using the "CALS DTD per MIL-M-28001".  If this is not
the DTD to which they are referring, a very strong message should come out
of the CALS Policy Office, the ISG EPC or someone.

Does the Air Force provide or recommend a FOSI for the 38784 DTD?

Len Bullard
Unisys Huntsville
</message>
<message id="<9405121435.AA02475@tecnet1.jcte.jcs.mil>" date="2977742114">
Newsgroups: comp.text.sgml
Date: 12 May 1994 14:35:14 UT
From: Debby Young \<young@tecnet1.jcte.jcs.mil>
Message-ID: <9405121435.AA02475@tecnet1.jcte.jcs.mil>
References: \<CpIJv2.HpE@news.Hawaii.Edu> <9405111956.AA18658@wpdis03.wpafb.af.mil> <9405121304.AA16979@source.asset.com>
Subject: Re: Documentation

[Len Bullard]

|   Isn't MIL-M-28001B the DTD that is being quoted for contracting purposes?

I don't believe that the 38784 DTD that was attached to 28001B is the only
DTD being quoted for contracting purposes.  I know that the air side of the
Navy uses another format, work package, for which a DTD and FOSI is being
finalized.

|   If there is confusion, it is widespread and spreading faster.  I've
|   received several phone calls lately in which I'm asked about tools to
|   support production using the "CALS DTD per MIL-M-28001".  If this is
|   not the DTD to which they are referring, a very strong message should
|   come out of the CALS Policy Office, the ISG EPC or someone.

From what I have gathered in my travels, the DTD attached 38784 was never
functionally tested to work, and the JCALS folks proved that it would not
support their requirements.  There is going to be a revision coming to
28001; I hear that because of JCALS findings that the priority to get the
standard updated has been elevated.

|   Does the Air Force provide or recommend a FOSI for the 38784 DTD?

I don't know about the Air Force providing a FOSI (I haven't found one yet)
but the Naval Air community will be finalizing the FOSI for 38784C for our
use by the end of June.  The Naval Air Electronic TM Team has also produced
an IDEF model which defines the management process for DTDs and FOSIs,
including configuration control, improving, and testing.  When the FOSI is
finalized, we will place it in the public domain.

--
Debby Young
Naval Air Warfare Center
Weapons Divsion
Point Mugu, CA
young@tecnet1.jcte.jcs.mil
(805) 484-6576
</message>
<message id="<9405121457.AA48131@source.asset.com>" date="2977743478">
Newsgroups: comp.text.sgml
Date: 12 May 1994 14:57:58 UT
From: Len Bullard \<bullardc@source.asset.com>
Message-ID: <9405121457.AA48131@source.asset.com>
Subject: Treeloc in HyTime

In the example given in ISO10744, the marklist is used to locate third
element of the second element in the second level:

\<marklist>1  2  3\</marklist>

where \<book> contains \<body> and \<annex>.  \<book> is root (1) and \<annex>
is the second element of the second level (2).  \<annex> has three \<figure>
elements and the example locates the third \<figure> element (3).  This is a
classical indentured list in which all elements are numbered.  So far so
good.

What is the correct coding for a recursive element declared with a content
model of elements that might also be recursive?  For example:

\<!ELEMENT techinfo -- (system)+ >
\<!ELEMENT system - - (system*, descinfo*, taskinfo*, partinfo*, faultinf*)>
</message>
<message id="<9405121509.AA48381@source.asset.com>" date="2977744155">
Newsgroups: comp.text.sgml
Date: 12 May 1994 15:09:15 UT
From: Len Bullard \<bullardc@source.asset.com>
Message-ID: <9405121509.AA48381@source.asset.com>
Subject: Treeloc in HyTime (continued)

To continue,

Given the system self-similar space, the following instance is:

\<techinfo>
   \<system>
      \<system>
         \<descinfo>...\</descinfo>
         \<descinfo>...\</descinfo>
         \<partinfo>...\</partinfo>
      \</system>
   \</system>
\</techinfo>

What is the marklist that locates the \<partinfo> element?

Len Bullard
</message>
<message id="<9405121550.AA15032@source.asset.com>" date="2977746631">
Newsgroups: comp.text.sgml
Date: 12 May 1994 15:50:31 UT
From: Len Bullard \<bullardc@source.asset.com>
Message-ID: <9405121550.AA15032@source.asset.com>
References: \<CpIJv2.HpE@news.Hawaii.Edu> <9405111956.AA18658@wpdis03.wpafb.af.mil> <9405121304.AA16979@source.asset.com> <9405121435.AA02475@tecnet1.jcte.jcs.mil>
Subject: Re: Documentation

Thanks, Debby.  What is the prescriptive scope your DTD?  Print, digital,
or both?  Will it be applied to IETMs?

Len
</message>
<message id="<24916@dog.cme.nist.gov>" date="2977749509">
Newsgroups: comp.text.sgml
Date: 12 May 1994 16:38:29 UT
From: Lisa Phillips \<phillips@cme.nist.gov>
Organization: National Institute of Standards & Technology, Gaithersburg, MD
Message-ID: <24916@dog.cme.nist.gov>
Subject: nesting entities

I previously sent a message regarding nesting entities and got a lot of
feedback from comp.text.sgml readers, all of whom basically told me that it
was in fact possible to nest entities in SGML.  Thanks for that feedback.
I did try my code with another parser and it turned out that it was,
indeed, an error in the original software that I was using and not a
limitation of SGML.

The question that follows is this: do you model your DTD according to the
software that your're using (even if there's a bug -- or as the vendor says
a "lack of a feature" in it).  What good is a DTD if you're software
doesn't understand it?!

L. Phillips
</message>
<message id="<2qtrn0$n07@lovecraft.convex.com>" date="2977755296">
Newsgroups: comp.infosystems.www,comp.text.sgml,comp.lang.perl
Date: 12 May 1994 18:14:56 UT
From: Earl Hood \<ehood@convex.com>
Organization: Engineering, Convex Computer Corporation, Richardson, Tx USA
Message-ID: <2qtrn0$n07@lovecraft.convex.com>
Subject: A nice Web site for it \<dtd2html>

Thanks to Glenn Vanderburg of UT Dallas, he has written a nice description
page about dtd2html.  You can access it with the followin URL:

        http://www.utdallas.edu/dtd2html/home.html

He has links to all the examples and to the actual distribution tar files.

I'd like to thank all the other people who have archived the program.
Here's the latest list of locations:

        ftp://ftp.law.indiana.edu/pub/www/
        ftp://ftp.funet.fi/pub/networking/services/www/misc/
        ftp://ftp.utdallas.edu/pub/dtd2html/
        http://caesar.cs.uiowa.edu/~jollie/dtd2html/
        ftp://ftp.ifi.uio.no/pub/SGML/DTD2HTML/
        ftp://ftp.sunet.se/pub/text-processing/sgml/DTD2HTML/
        ftp://ftp.sunet.se/pub/www/utilities/dtd2html/

Note:
    All locations listed above are subject to change, w/o notice, by the
    individual maintainers of each site (ooh ... sounds too legal for me
    :-)

	--ewh
--
    Earl Hood                |   CONVEX Computer Corporation
    ehood@convex.com         |   3000 Waterview Parkway
    Phone: (214) 497-4387    |   P.O. Box 833851
    FAX: (214) 497-4500      |   Richardson, TX  75083-3851
</message>
<message id="<9405121840.AA11060@wpdis03.wpafb.af.mil>" date="2977756856">
Newsgroups: comp.text.sgml
Date: 12 May 1994 18:40:56 UT
From: "Richard G. Littleton" \<rlittlet@wpdis03.wpafb.af.mil>
Message-ID: <9405121840.AA11060@wpdis03.wpafb.af.mil>
References: <24916@dog.cme.nist.gov>
Subject: Re: nesting entities

[Lisa Phillips]

|   I previously sent a message regarding nesting entities and got a lot of
|   feedback from comp.text.sgml readers, all of whom basically told me
|   that it was in fact possible to nest entities in SGML.  Thanks for that
|   feedback.  I did try my code with another parser and it turned out that
|   it was, indeed, an error in the original software that I was using and
|   not a limitation of SGML.
|
|   The question that follows is this: do you model your DTD according to
|   the software that your're using (even if there's a bug -- or as the
|   vendor says a "lack of a feature" in it).  What good is a DTD if you're
|   software doesn't understand it?!

By the same token, what good is a DTD if your software is the ONLY one that
understands it?  Sounds like someone owes you an upgrade.
</message>
<message id="<9405121902.AA47336@source.asset.com>" date="2977758152">
Newsgroups: comp.text.sgml
Date: 12 May 1994 19:02:32 UT
From: Len Bullard \<bullardc@source.asset.com>
Message-ID: <9405121902.AA47336@source.asset.com>
References: <24916@dog.cme.nist.gov>
Subject: Re: nesting entities

Welcome to the wonderful world of features mapping!  Elegance argues in
favor of the "standard solution" with low coupling and high cohesion just
like deMarco says.  However, to pay the bills in the meantime, do what the
software allows and if that isn't enough, start the acquisition for new
software.  I have yet to see the software for which I can't write an
incompatible abstract data type.  A DTD is what it is: a filter to prove or
disprove the well-formedness of the instance.  That you can use it for
other things is a nice byproduct of standard markup, but wholely at the
discretion of the developer.  For that reason, most of us would like to see
SGML tools that work like the Lego blocks of Visual Basic.  Then we could
buy and pay for the pieces we need and snap them together.  For example, a
HyTime engine as a vbx would sure look good under the Christmas tree this
year.  A standard for embedding behavior (scripts) and an interpreter that
was fast as another vbx would look good in the Easter basket.

All this and more..soon we hope.

Len bullard
</message>
<message id="<2qu903$la8@rs6a.wln.com>" date="2977767165">
Newsgroups: comp.text.sgml
Date: 12 May 1994 21:32:45 UT
From: Joe Ford \<fordjb@wln.com>
Message-ID: <2qu903$la8@rs6a.wln.com>
Subject: HTML, RS6000, and this group

I'm looking for RS6000 (AIX) software to develop HTML documents.  Is this
group appropriate?

Does anyone who subscribes have information you'd be willing to share
regarding HTML editors or software for creating HTML documents on an
RS6000.

Thanks in advance.

Joe Ford
</message>
<message id="<2qu8e6$e06@maggie.austin.wireline.slb.com>" date="2977768325">
Newsgroups: comp.text.sgml
Date: 12 May 1994 21:52:05 UT
From: Robert Streich \<streich@tenex.SLCS.SLB.COM>
Organization: Schlumberger Austin Research
Message-ID: <2qu8e6$e06@maggie.austin.wireline.slb.com>
References: <2DCE5E30@noak.vxo.telub.se> <9405092307.AA1636@notes.notes>
Subject: Re: JPEG and MPEG notations

[Ken Holman]

|   Whether this document becomes a TR (Technical Report) or not remains to
|   be seen.  We have solicited input from member bodies in the resolutions
|   of the meeting document N1672:

This is a good idea, Ken. How about making it part of 9573? It seems like
the perfect place for it.

--
Robert Streich			streich@austin.sar.slb.com
Schlumberger			+1 512 331 3318 (voice)
Austin Research			+1 512 331 3760 (fax)
</message>
<message id="<1994May12.223731.12916@ctp.com>" date="2977771051">
Newsgroups: comp.text.sgml
Date: 12 May 1994 22:37:31 UT
From: Eddy Wong \<ewong@ctp.com>
Organization: Cambridge Technology Partners
Message-ID: <1994May12.223731.12916@ctp.com>
Subject: MHEG format

Hi netters,

Has anybody heard about a format called "MHEG"?  All I know is that it's
related to text processing.

Any information would be appreciated.

Thanks,

Eddy.
--
 /\\/\\ | Eddy Wong		ewong@ctp.com 	attmail!riker!ewong 	|
/ /_.\\| Cambridge Technology Partners, Inc.	+1 617 374 8410		|
\\  /./| 304 Vassar St, Cambridge, MA 02139   	+1 617 374 8300 (fax)	|
 \\/\\/ |									|
</message>
<message id="<mbroschi.7.000EF6AD@email.state.ut.us>" date="2977826262">
Newsgroups: comp.text.sgml
Date: 13 May 1994 13:57:42 UT
From: Mike Broschinsky \<mbroschi@email.state.ut.us>
Organization: Utah State Div. of Administrative Rules
Message-ID: \<mbroschi.7.000EF6AD@email.state.ut.us>
Subject: HTML primer: where to obtain?

I am looking for a quick reference to HTML or some sort of primer which I
can print out to use as a reference while creating HTML documents.

Our office currently creates a Folio Infobase version of the _Utah
Adminstrative Code_ (the compilation of all executive branch regulations of
the state of Utah).  I am interested in trying to create an HTML version of
this compilation.

Thanks

--
Mike Broschinsky
mbroschi@email.state.ut.us

"To sleep, perchance to dream..."
</message>
<message id="<memo.106787@cix.compulink.co.uk>" date="2977826538">
Newsgroups: comp.text.sgml
Date: 13 May 1994 14:02:18 UT
From: Richard Parker \<pindar3@cix.compulink.co.uk>
Message-ID: \<memo.106787@cix.compulink.co.uk>
Subject: AAPBK Code list

Hello Netbeings,

Does anyone have a document, or information on the meaning/usage of all the
codes in the AAPBK.dtd?  If so please e-mail me it, or it's location?

Thanx,

Rich
--
Richard Parker             | I speak for myself, not Pindar
Pindar Infotek (York)      | (unless it's profitable)
UK                         |
</message>
<message id="<Cpr498.Avx@mail.auburn.edu>" date="2977838539">
Newsgroups: comp.text.sgml
Date: 13 May 1994 17:22:19 UT
From: Dean Hendrix \<thendrix@eng.auburn.edu>
Organization: Auburn University Engineering
Message-ID: \<Cpr498.Avx@mail.auburn.edu>
Subject: SGML for CASE Tools

I'm working on using SGML as the basis for a program visualization tool.
Does anyone know of other work that uses SGML (or another markup language)
to provide graphical representations of source code (e.g., control
structure diagrams, flow charts, etc.)?

Any postings to this group or personal email on this question would be most
appreciated.

--
Dean Hendrix
Computer Science and Engineering
Auburn University
internet: thendrix@eng.auburn.edu
</message>
<message id="<9405132058.AA24677@wpdis03.wpafb.af.mil>" date="2977851492">
Newsgroups: comp.text.sgml
Date: 13 May 1994 20:58:12 UT
From: "Richard G. Littleton" \<rlittlet@wpdis03.wpafb.af.mil>
Message-ID: <9405132058.AA24677@wpdis03.wpafb.af.mil>
References: \<CpIJv2.HpE@news.Hawaii.Edu> <9405111956.AA18658@wpdis03.wpafb.af.mil> <9405121304.AA16979@source.asset.com>
Subject: Re: Documentation

[Len Bullard]

|   Isn't MIL-M-28001B the DTD that is being quoted for contracting
|   purposes? If there is confusion, it is widespread and spreading faster.
|   I've received several phone calls lately in which I'm asked about tools
|   to support production using the "CALS DTD per MIL-M-28001".  If this is
|   not the DTD to which they are referring, a very strong message should
|   come out of the CALS Policy Office, the ISG EPC or someone.
|
|   Does the Air Force provide or recommend a FOSI for the 38784 DTD?

This is based on how the Air Force does business, the other Services may
differ.

The MIL-M-28001 DTD should NEVER be quoted in a contracting action.  The
only time MIL-M-28001 itself should ever be quoted is if the contracting
action is specifically to build DTDs, not tech manuals.  (Only the
Preparing Activity for a specification should be contracting specifically
to have DTDs built.)  If a contractor is building tech manuals, the only
way they should ever get into MIL-M-28001 is when there is no public DTD
available for the manual(s) they are building.  In that case, MIL-STD-1840
would require them to build a DTD in accordance with MIL-M-28001 (NOT use
the DTD that is in MIL-M-28001).  Here's how the events should unfold.  The
contract calls for delivery of digital data in accordance with the
requirements of MIL-STD-1840 (MIL-M-28001 should never be in the contract
other than by reference from MIL-STD-1840).  MIL-STD-1840 requires the data
be tagged in accordance with appropriate public DTDs (for the Air Force,
the public DTDs are contained in the Appendix(s) to the specification).  If
an appropriate public DTD is available, the contractor uses it to tag the
data.  If an appropriate public DTD is not available, MIL-STD-1840 requires
that the contractor build one in accordance with MIL-M-28001 (again, not to
just use the DTD found in 28001).  In addition, the text of MIL-M-28001
clearly states this DTD is an EXAMPLE.

--
Rich
                               "Only two things are infinite, the
                                universe and human stupidity, and
                                I'm not sure about the former."
                                                  Albert Einstein
</message>
<message id="<2r0v36$8qf@search01.news.aol.com>" date="2977857062">
Newsgroups: comp.text.sgml
Date: 13 May 1994 22:31:02 UT
From: Eliot Kimber \<drmacro@aol.com>
Message-ID: <2r0v36$8qf@search01.news.aol.com>
References: <9405121902.AA47336@source.asset.com>
Subject: Re: Tool limitations (was nesting entities)

There are two different classes of limitations:

1. Outright bugs and non-conformances
2. Non-support of optional features.

In the first case, if you've paid money for your software, and it claims to
be a conforming SGML application, you are owed either a refund or a fix.
There's enough competition in the SGML industry that you do have other
options.

In the second case, you'll have to try to convince your vendor that it is
in their interest to implement support for the optional features you need
and they don't support (such as the campaign by Erik Naggum and myself to
convince vendors to implement the SGML LINK feature).

Some non-conformances are subtle, and may only be revealed by nutcases like
myself who go out of their way to push the limits of what tools can do.
Others are simply bugs.  I try to be patient with the former, but there's
no excuse for the latter.  Well-tested validating parsers have been
available for a number of years now -- no-one can reasonably claim that
they weren't quite sure what the correct behavior was (not to mention that
this newsgroup will answer any such question quickly and thoroughly).

On the other hand, I've only worked with one SGML tool that did everything
it said it was capable of completely correctly, and since that product is
only available on VM/CMS and MVS, it's unlikely you will ever have a chance
to use it.  Every other tool I've used, although generally correct and
functional, has some fatal flaw that makes it, at least under certain
circumstances, non-conforming.  For example, Author/Editor hoses parameter
entities in the internal part of the DTD subset, while Arbortext's SGML
publisher lets you create invalid instances.  InContext provides no
mechanism for mapping public identifiers to system identifiers.  And as far
as I know, OmniMark provides no way to modify its entity resolution
behavior (I could be wrong on this one -- I don't own a copy of OmniMark,
I've only worked with an eval copy).

So you have to make do with what you've got, but you don't have to be quiet
about it.  A key benefit of SGML -- application independence -- is
seriously compromised if your tool won't do things that any SGML tool
should do or creates files that you may reasonably think are valid SGML
documents but are in fact not.

I should say that I have been fairly successful at building an
SGML-centered personal work station where I can use a variety of SGML-based
tools to do a wide variety of tasks from an invariant database of SGML
documents that are, as far as I know, completely uncompromised SGML
documents (in other words, they're the way I want them to be, not the way
any of my applications want them to be, with the exception of some uses of
parameter entities in internal DTD subsets since Author/Editor cannot
manage them correctly).  I use SGML for almost everything I do.  I've
integrated SGML processing with Perl, Visual Basic, and VX/Rexx.  I do this
with freeware and commercial products, and it's getting easier all the
time.

I agree with Len Bullard that a HyTime VBX (or, perhaps even better, a SOM
object) would be a very nice holiday gift.  Don't think it hasn't been
discussed.

There are a lot of ways that, without too much effort, SGML can be
integrated with these very powerful enabling tools (just today I wrote a
little VX/Rexx program to allow me to define public identifiers and SGML
declarations as OS/2 extended attributes, which I then use in a shell exec
to control SGMLS-based processes and Author/Editor mkrls processing (I can
drag a DTD onto an icon and the rules are built and put into the right
directory automatically) -- next I'll define an object type of SGML under
OS/2; from there, it's not too much farther to having an SGML-aware
operating system).

--
\<Address HyTime=bibloc>
W. Eliot Kimber (kimber@passage.com) Systems Analyst and HyTime Consultant
Passage Systems, Inc., 9971 Quail Blvd., Suite 903, Austin TX 78758, +1 512 339 1400
465 Fairchild Dr., Suite 201, Mountain View, CA  94043, +1 415 390 0911
\</Address>
</message>
<message id="<2r0vtf$8vh@search01.news.aol.com>" date="2977857903">
Newsgroups: comp.text.sgml
Date: 13 May 1994 22:45:03 UT
From: Eliot Kimber \<drmacro@aol.com>
Message-ID: <2r0vtf$8vh@search01.news.aol.com>
References: <9405121509.AA48381@source.asset.com>
Subject: Re: Treeloc in HyTime (continued)

[Len Bullard]

|   \<techinfo>
|     \<system>
|	 \<system>
|	    \<descinfo>...\</descinfo>
|	    \<descinfo>...\</descinfo>
|	    \<partinfo>...\</partinfo>
|	 \</system>
|     \</system>
|   \</techinfo>
|  
|   What is the marklist that locates the \<partinfo> element?

\<treeloc id=partinfo-loc>1 1 1 3\</treeloc>

As it happens, I'm trying to work up a modification to Jim Clark's SGMLS.PL
Perl script to add some HyTime-specific functionality, including
calculating the tree location of every element in a document.  It's a
pretty straightforward recursive algorithm.  You keep the current parent's
treeloc in a global variable, and for each child, calculate its child
number and append it to its parent's treeloc.  You can use push-down stacks
for the parent treelocs and child number counters to manage the global
variables as you move up and down the instance tree.

--
\<Address HyTime=bibloc>
W. Eliot Kimber (kimber@passage.com) Systems Analyst and HyTime Consultant
Passage Systems, Inc., 9971 Quail Blvd., Suite 903, Austin TX 78758, +1 512 339 1400
465 Fairchild Dr., Suite 201, Mountain View, CA  94043, +1 415 390 0911
\</Address>
</message>
<message id="<2r2uv1$ldt@search01.news.aol.com>" date="2977922465">
Newsgroups: comp.text.sgml
Date: 14 May 1994 16:41:05 UT
From: Eliot Kimber \<drmacro@aol.com>
Message-ID: <2r2uv1$ldt@search01.news.aol.com>
References: <2r0vtf$8vh@search01.news.aol.com>
Subject: Re: Treeloc in HyTime (continued)

[Eliot Kimber]

|   You can use push-down stacks for the parent treelocs and child number
|   counters to manage the global variables as you move up and down the
|   instance tree.

I discovered last night when I actually implemented this algorithm that the
child stack itself, including the current element's child number, *is* the
tree location of that element (don't know why I didn't see that before).
Therefore, there's no need to keep a separate stack of tree locations.

--
\<Address HyTime=bibloc>
W. Eliot Kimber (kimber@passage.com) Systems Analyst and HyTime Consultant
Passage Systems, Inc., 9971 Quail Blvd., Suite 903, Austin TX 78758, +1 512 339 1400
465 Fairchild Dr., Suite 201, Mountain View, CA  94043, +1 415 390 0911
\</Address>
</message>
<message id="<CpsxGF.9wK@news.cis.umn.edu>" date="2977923014">
Newsgroups: comp.text.sgml
Date: 14 May 1994 16:50:14 UT
From: R A Milowski \<milor001@maroon.tc.umn.edu>
Organization: University of Minnesota
Message-ID: \<CpsxGF.9wK@news.cis.umn.edu>
References: <9405121902.AA47336@source.asset.com> <2r0v36$8qf@search01.news.aol.com>
Subject: Re: Tool limitations (was nesting entities)

Boy, would I like to have LINK features implemented in a commercial tool!
I could finally implement a multi-step conversion process that I have
designed with a particular client in mind...  oh well, not enough time to
develop it myself.

--
R. Alexander Milowski
SGML Researcher
milor001@maroon.tc.umn.edu
</message>
<message id="<19940515.2234@naggum.no>" date="2978007869">
Newsgroups: comp.text.sgml
Date: 15 May 1994 16:24:29 UT
From: Erik Naggum \<erik@naggum.no>
Organization: Naggum Software; +47 2295 0313
Message-ID: <19940515.2234@naggum.no>
References: <24916@dog.cme.nist.gov>
Subject: Re: nesting entities

[Lisa Phillips] (1994-05-12)

|   The question that follows is this: do you model your DTD according to
|   the software that your're using (even if there's a bug -- or as the
|   vendor says a "lack of a feature" in it).  What good is a DTD if you're
|   software doesn't understand it?!

the attitude you present is the _old_ attitude -- where software vendors
and programmers owned the users' data, and thought they had every right to
dictate the terms of its use.  we want to get away from that, and instead
have the vendors and programmers understand that they are nothing without
the data we ask them to process.  it is _our_ data, not theirs.

so, the question _should_ be: what good is the software if it doesn't
understand your data?  to think that data is flexible and software isn't,
is to allow that to become true, and in the process destroy the value of
your data.

(there _are_ legitimate reasons why software may want to limit what it
understands.  if it does, it should also tell you so when it happens, as
opposed to just crash on you, and preferably before it happens, as in the
documentation that the user should be required to read.  software is buggy
and dysfunctional because its users allow it to be, and they are the only
ones who can change that.)

best regards,
\</Erik>
--
Erik Naggum \<erik@naggum.no> \<SGML@ifi.uio.no>  |  memento, terrigena.
ISO 8879 SGML, ISO 10744 HyTime, ISO 10646 UCS  |  memento, vita brevis.
</message>
<message id="<769022215snz@pctrain.demon.co.uk>" date="2978011015">
Newsgroups: comp.text.sgml
Date: 15 May 1994 17:16:55 UT
From: Basil Stephens \<basil@pctrain.demon.co.uk>
Organization: DIS(organised)
Message-ID: <769022215snz@pctrain.demon.co.uk>
Subject: CONREF

Examples I have seen associate the Use of CONREF with the use of no end
element, obviously as defined by the .dtd; why is this, please?  I can't
find anything in Goldfarb so far (;-)) which requires this!

Example

    \<!ELEMENT fred      - o  (#PCDATA)>
    \<!ATTLIST fred      id  IDREF   #CONREF >

Basil
</message>
<message id="<2r5r14INN5sc@smokies.cis.ohio-state.edu>" date="2978016740">
Newsgroups: comp.text.sgml
Date: 15 May 1994 18:52:20 UT
From: William Bradley \<bradley@cis.ohio-state.edu>
Organization: The Ohio State University Dept. of Computer and Info. Science
Message-ID: <2r5r14INN5sc@smokies.cis.ohio-state.edu>
Subject: Databases and/or Object Stores for SGML and/or HTML Documents

Hi,

I am looking for information on projects (commercial or academic) which
revolve around the storage of SGML and/or HTML documents in databases or
object stores.

Any information would be greatly appreciated.

thankX,

Bill B.

--
William Bradley                                 bradley@cis.ohio-state.edu
CIS Dept, Ohio State University                 w/ +1 614 292 7318
228 Bolz Hall, 2036 Neil Ave.                   h/ +1 614 293 0822
Columbus, OH 43210
</message>
<message id="<2r6u5rINNcv2@dubhe.anu.edu.au>" date="2978052731">
Newsgroups: comp.text.sgml
Date: 16 May 1994 04:52:11 UT
From: Ian Macleod \<macleod@qucis.queensu.ca>
Organization: Queen's University at Kingston
Message-ID: <2r6u5rINNcv2@dubhe.anu.edu.au>
Subject: Call for Papers

COMPUTER STANDARDS & INTERFACES

The International Journal on the Development and Application of Standards
for Computers, Data Communications and Interfaces


Editors-in-Chief:

John L. Berg
Harald Schumny


CALL FOR PAPERS:

Special Issue on SGML

Guest Editor:

Ian A. Macleod
Department of Computing and Information Science
Queen's University
Kingston
Ontario K7L 3N6
CANADA

Email: macleod@qucis.queensu.ca
Phone: (613)-545-6059
FAX: (613)-545-6513

SGML (the Standardized General Markup Language) is an international
standard whose importance is rapidly growing.  It is fair to say that the
era of electronic text has finally arrived.  A large number of potential
text applications are seeking solutions, and there is significant
industrial interest in the technologies being developed in the SGML
context.

In view of the high importance of SGML, Computer Standards and Interfaces
is planning a special issue on this topic to be published in mid 1995.  The
goal is to collect papers incorporating important advances in the field.

Topics of interest include, but are not limited to, the following:

-- Novel applications of SGML
-- SGML databases and information retrieval
-- Languages for accessing and manipulating SGML structures
-- Hypertext/Hypermedia
-- Entity management
-- Visualisation and SGML
-- Related standards and SGML
-- Converting legacy databases to SGML
-- Tools for developing and using DTDs

Papers should be sent to the address above.

They should be no longer than 25 pages double spaced in a font size of at
least 12 pouints.

IMPORTANT DATES:

31 August 1994: deadline for paper submission

31 October 1994: notification of preliminary results

15 December 1994: deadline for submission of revised papers

15 February 1995: Notification of final acception/rejection

1 April 1995:  deadline for having the paper to the publisher.
</message>
<message id="<2r71dj$t1d@ricrdc2>" date="2978056051">
Newsgroups: comp.text.sgml
Date: 16 May 1994 05:47:31 UT
From: Shigeya Senda \<senda@ipe.rdc.ricoh.co.jp>
Organization: Information\&Communication R\&D Center, RICOH Co., Ltd.
Message-ID: <2r71dj$t1d@ricrdc2>
Subject: [Q] question about marked section

Hi.

I have a question in marked section grammer.

At "status keyword specification" in marked section declaration, we can use
parameter entity references (and cannot use general entity references) like
following:

    \<!ENTITY %confiden "INCLUDE">
    \<!ENTITY %public   "IGNORE">

    \<![ %confiden; [ ..... included ...... \]]>
    \<![ %public; [ ..... not included ...... \]]>

Why parameter entity?  I feel it is natural to use general entity reference
in document instance part.

Thanks.

--
+--------------------------------------------------------------------+
|   RICOH RESEACH AND DEVELOPMENT CENTER          Shigeya SENDA      |
|   ($B3t(B)$B%j%3!<(B  $B8&3+K\\(B $B>pDL8&(B $B>pJs8&(BC 421$B8&5f<<(B		$B@iED<"Li(B     |
+--------------------------------------------------------------------+
</message>
<message id="<2r7a46$dkn@osiris.wu-wien.ac.at>" date="2978064966">
Newsgroups: comp.text.sgml
Date: 16 May 1994 08:16:06 UT
From: Rony Flatscher \<Rony.Flatscher@wu-wien.ac.at>
Organization: University of Economics and Business Administration
Message-ID: <2r7a46$dkn@osiris.wu-wien.ac.at>
Subject: IBMIDDOC.BOO and INFOMAST.BOO - not readable ?

IBMIDDOC.BOO and INFOMAST.BOO cannot be read with readibm/2 (OS/2 version);
i.e., book-details are displayable, the book itself not; the error message
indicates that it cannot be read with the actual (am using the latest
commercially available reader) version of readibm/2.

Does anyone have any ideas what to do ?

---rony
</message>
<message id="<2r7f2f$p7q@rs18.hrz.th-darmstadt.de>" date="2978070031">
Newsgroups: comp.infosystems.www,comp.text.sgml,comp.lang.perl
Date: 16 May 1994 09:40:31 UT
From: Joachim Schrod \<schrod@iti.informatik.th-darmstadt.de>
Organization: TH Darmstadt, FG Systemprogrammierung
Message-ID: <2r7f2f$p7q@rs18.hrz.th-darmstadt.de>
References: <2qtrn0$n07@lovecraft.convex.com>
Subject: Re: dtd2html (A nice Web site for it)

[Earl Hood]

|   Here's the latest list of locations:
|
|   	ftp://ftp.law.indiana.edu/pub/www/
|   	ftp://ftp.funet.fi/pub/networking/services/www/misc/
|   	ftp://ftp.utdallas.edu/pub/dtd2html/
|   	http://caesar.cs.uiowa.edu/~jollie/dtd2html/
|   	ftp://ftp.ifi.uio.no/pub/SGML/DTD2HTML/
|   	ftp://ftp.sunet.se/pub/text-processing/sgml/DTD2HTML/
|   	ftp://ftp.sunet.se/pub/www/utilities/dtd2html/

Add

	ftp://ftp.th-darmstadt.de/pub/text/sgml/DTD2HTML

Thanks for this nice tool.

Joachim
[THD SGML Archive, maintainer]

--
Joachim Schrod			Email: schrod@iti.informatik.th-darmstadt.de
Computer Science Department
Technical University of Darmstadt, Germany
</message>
<message id="<19940516.2247@naggum.no>" date="2978075546">
Newsgroups: comp.text.sgml
Date: 16 May 1994 11:12:26 UT
From: Erik Naggum \<erik@naggum.no>
Organization: Naggum Software; +47 2295 0313
Message-ID: <19940516.2247@naggum.no>
References: <2r71dj$t1d@ricrdc2>
Subject: Re: [Q] question about marked section

[Shigeya Senda]

|   I have a question in marked section grammer.
|
|   At "status keyword specification" in marked section declaration, we can
|   use parameter entity references (and cannot use general entity
|   references) ...
:
|   Why parameter entity?  I feel it is natural to use general entity
|   reference in document instance part.

a parameter entity reference is required because the keyword is a parameter
in the marked section declaration.

similar arguments apply to USELINK and USEMAP declarations occurring in the
document instance.

this _does_ have the undesirable side-effect of not being able to separate
general entities and parameter entities as user and programmer entities,
respectively.  in consequence, to believe that the two are different in
this manner is not very helpful.  the other side of this issue is that
document types frequently include scores of entity declarations for special
characters, which encroach on the "user" name space.

this may be a general issue concerning name spaces that is worth pursuing.

best regards,
\</Erik>
--
Erik Naggum \<erik@naggum.no> \<SGML@ifi.uio.no>  |  memento, terrigena.
ISO 8879 SGML, ISO 10744 HyTime, ISO 10646 UCS  |  memento, vita brevis.
</message>
<message id="<2r7l8c$mpi@search01.news.aol.com>" date="2978076364">
Newsgroups: comp.text.sgml
Date: 16 May 1994 11:26:04 UT
From: Eliot Kimber \<drmacro@aol.com>
Message-ID: <2r7l8c$mpi@search01.news.aol.com>
References: <2r7a46$dkn@osiris.wu-wien.ac.at>
Subject: Re: IBMIDDOC.BOO and INFOMAST.BOO - not readable?

[Rony Flatscher]

|   IBMIDDOC.BOO and INFOMAST.BOO cannot be read with readibm/2 (OS/2
|   version); i.e., book-details are displayable, the book itself not; the
|   error message indicates that it cannot be read with the actual (am
|   using the latest commercially available reader) version of readibm/2.

Make sure you've transfered those files in binary mode from the FTP site.
I can read them with BookManager/DOS (I don't have Read/2 on my machine).
If you're sure you've transfered them correctly and still can't read them,
send me a note and I'll see if I can find the problem (Wayne Wohler is in
Europe this week and probably isn't checking his mail).  I won't have FTP
access for a couple of days, so I can't try grabbing them myself.  Note
that these BookManager books cannot be read with the IBM Library Reader
that comes with the documentation CD-ROMs that IBM distributes.

--
\<Address HyTime=bibloc>
W. Eliot Kimber (kimber@passage.com) Systems Analyst and HyTime Consultant
Passage Systems, Inc., 9971 Quail Blvd., Suite 903, Austin TX 78758, +1 512 339 1400
465 Fairchild Dr., Suite 201, Mountain View, CA  94043, +1 415 390 0911
\</Address>
</message>
<message id="<9405161250.AA28352@source.asset.com>" date="2978081411">
Newsgroups: comp.text.sgml
Date: 16 May 1994 12:50:11 UT
From: Len Bullard \<bullardc@source.asset.com>
Message-ID: <9405161250.AA28352@source.asset.com>
References: \<Cpr498.Avx@mail.auburn.edu>
Subject: Re: SGML for CASE Tools

[Dean Hendrix]

|   I'm working on using SGML as the basis for a program visualization
|   tool.  Does anyone know of other work that uses SGML (or another markup
|   language) to provide graphical representations of source code (e.g.,
|   control structure diagrams, flow charts, etc.)?

Not for source code.  However, Knowledge Based Systems, Inc. at College
Station Texas (+ 1 409 260 5274 (last number I have)) used an SGML DTD to
represent IDEF method diagrams.  We did some demonstrations for the US Army
using the Army's IADS product in which Paintbrush diagrams were used to
create navigation paths through an SGML hypertext method.  At one time, it
was proposed that VISIO software be used in conjunction with IADS for a
more automated process.  I wrote some work for General Electric that used
SGML to represent Hatley-Pirbhai methods, but we didn't generate the
diagram from the SGML; as with the Army work,we used it to represent the
internal data which the diagrams visualized.

Do you plan to use xenoforms, e.g., wrapping C with SGML elements and
notations for the interpreter call, or actually represent code in the SGML
syntax?  We have looked at using SGML for CASE tools here and may have some
ideas you can use.

Len Bullard - Unisys
</message>
<message id="<49@markj.win.net>" date="2978086054">
Newsgroups: comp.text.sgml
Date: 16 May 1994 14:07:34 UT
From: "Mark T. Jacobson" \<markj@markj.win.net>
Message-ID: <49@markj.win.net>
Subject: Need Sample Instances of 12083 Book & Article

Can someone tell me where I might find sample instances of a book chapter
and an article marked up using the 12083 standard?  I am new to the net and
have never downloaded anything so I might need some help with that as well.
</message>
<message id="<50@markj.win.net>" date="2978086562">
Newsgroups: comp.text.sgml
Date: 16 May 1994 14:16:02 UT
From: "Mark T. Jacobson" \<markj@markj.win.net>
Message-ID: <50@markj.win.net>
Subject: Author/Editor and 12083 DTDs

I am trying to run RulesBuilder to complie the 12083 Book DTD.  I am
running the Windows version.  I followed the instructions to comment out
the \<!DOCTYPE PUBLIC...[ entry at the top of the DTD and the ]> at the
bottom, but I keep getting an error message indicating that points to the
end of the DTD indicating that an unexpected character was found and that
< is expected.  Can someone help me figure how to get this DTD to compile?
</message>
<message id="<1994May16.145935.9648@xgml.com>" date="2978089175">
Newsgroups: comp.text.sgml
Date: 16 May 1994 14:59:35 UT
From: "Eric R. Skinner" \<ers@xgml.com>
Organization: Exoterica Corporation
Message-ID: <1994May16.145935.9648@xgml.com>
References: <9405121902.AA47336@source.asset.com> <2r0v36$8qf@search01.news.aol.com>
Subject: Re: Tool limitations (was nesting entities)

[Eliot Kimber]

|   Every other tool I've used, although generally correct and functional,
|   has some fatal flaw that makes it, at least under certain
|   circumstances, non-conforming.  [...] OmniMark provides no way to
|   modify its entity resolution behavior (I could be wrong on this one --
|   I don't own a copy of OmniMark, I've only worked with an eval copy).

So far OmniMark has provided the following in this area:

- Control over resolution of text entities (such as external entity
  references in the instance), through its reconfigurable LIBRARY rule.
  Either the programmer or the user can specify the mapping of public
  identifiers to system file names and where in the file system to look for
  these files.

- Extensive control over resolution of data entities (such as graphics
  attribute entities) through the EXTERNAL-ENTITY rule.  You can write
  rules to specify processing of specific entities, all entities with a
  certain notation, with tests of data attributes, etc.  Within the body of
  the rule you have full access to the system identifier, public
  identifier, notation name, the notation's system and public identifiers,
  and suggested resolution.  The action you take is then up to you; it
  could involve file i/o, or any other OmniMark manipulation.

In OmniMark V2R4, announced today at SGML Europe for shipment in Q3 1994,
the significant SGML-to-SGML enhancements include a new
EXTERNAL-TEXT-ENTITY rule providing all the flexibility previously only
available for data entities.  Among other capabilities, this new rule can
support the upcoming SGML Open Entity Mapping format directly in OmniMark
source code, and allows programmers to write their own "entity managers".

--
Eric R. Skinner                          ers@exoterica.com
Exoterica Corporation                  Tel +1 613 722 1700
Ottawa, Canada                         Fax +1 613 722 5706
Product information:                    info@exoterica.com
</message>
<message id="<9405161509.AA28642@source.asset.com>" date="2978089782">
Newsgroups: comp.text.sgml
Date: 16 May 1994 15:09:42 UT
From: Len Bullard \<bullardc@source.asset.com>
Message-ID: <9405161509.AA28642@source.asset.com>
References: <9405121509.AA48381@source.asset.com> <2r0vtf$8vh@search01.news.aol.com>
Subject: Re: Treeloc in HyTime

[Eliot Kimber]

|   \<treeloc id=partinfo-loc>1 1 1 3\</treeloc>

Thanks for the reply, Eliot.  So, the treeloc doesn't care about the
recursion of the system element.  It strictly expresses a location as a
classical level of indenture starting with the root element as 1.  Given
that the tree could contain a large number of levels, the alternative would
be a pathloc which does not require complete expression of the indenture.
Is there a way to express recursion more succinctly, i.e, there could be a
more compact address?  I suppose one could kluge a method that uses an
attribute to count levels and a loc to get that value, but that seems
Byzantine?

Len
</message>
<message id="<9405161512.AA22556@source.asset.com>" date="2978089925">
Newsgroups: comp.text.sgml
Date: 16 May 1994 15:12:05 UT
From: Len Bullard \<bullardc@source.asset.com>
Message-ID: <9405161512.AA22556@source.asset.com>
Subject: SOM

Pardon my ignorance.  What is an SOM object?

Len
</message>
<message id="<16FB8107D6S85.UPP201@ibm.rhrz.uni-bonn.de>" date="2978095542">
Newsgroups: comp.text.sgml
Date: 16 May 1994 16:45:42 UT
From: Michael Burschik \<UPP201@ibm.rhrz.uni-bonn.de>
Organization: RHRZ Uni-Bonn
Message-ID: <16FB8107D6S85.UPP201@ibm.rhrz.uni-bonn.de>
Subject: SGML, HTML and LaTeX

There seem to be some tools that convert SGML or HTML documents into LaTeX
for printing.  What is the most recommendable package?  Are there any tools
for converting LaTeX files into SGML or HTML?

Cheers,
        Mike.
</message>
<message id="<masonjd.9.2DD7A9EB@ornl.gov>" date="2978097259">
Newsgroups: comp.text.sgml
Date: 16 May 1994 17:14:19 UT
From: James David Mason \<masonjd@ornl.gov>
Organization: Convenor, ISO/IEC JTC1/SC18/WG8
Message-ID: \<masonjd.9.2DD7A9EB@ornl.gov>
References: \<Robbin.9.00109A84@[140.138.150.213]>
Subject: Re: How can l get information about SMSL?

[Robbin Wong]

|   How can I get information about SMSL?
|   
|   The Standard Multimedia Scripting Language (SMSL) is a joint project
|   between SC18/WG8, the developers of SGML and HyTime, and SC29/WG12, the
|   Multimedia /Hypermedia Expert Grp (MHEG) which start in 1992.
|
|   If you know something about the Standard,please reply to me.
|
|   Reply will be appreciate and thank you for your reading.

SMSL is a joint project of ISO/IEC JTC1/SC18/WG8, the creators of SGML and
HyTime, and ISO/IEC JTC1/SC29/WG12, who are a coding group, formerly in
ISO/IEC JTC1/SC2, the creators of ISO 646 and ISO 10646.  The original
agreement was that SC18/WG8 would write a functional specification and
SC29/WG12 would do an encoding.  It appears at the moment that, while the
two groups maintain liaison, they are not really working on the same
project.  WG8 is interested in creating (or adopting) a scripting LANGUAGE
that can describe processes as well as results, and SC29/WG12 is interested
in assembling a set of control codes that can describe a static result to
be displayed over some sort of specialized terminal (e.g., a later version
of the French Minitel).  In other words, WG8 wants a tool for describing
all sorts of multimedia products, and WG12 wants to specifiy a
tlecommunications service protocol.

WG8 has destributed an internal draft of SMSL and received comments on it.
The editor is preparing a new text that should go out to be registered and
balloted as a CD (committee draft, the level before DIS, or draft
international standard) sometime this summer.  The editor is:

Mr. Brian Markey
Permanent Wave Productions
18 Harris Ave.
Shrewsbury, MA  01545
 U.S.A.
Telephone: +1 508 842 4422
Network: bmarkey@world.std.com

If you are interested in participating in the SMSL project, the next
meeting of WG8 will be in Oslo, Norway, 24-30 August 1994.  Contact you
national standards body to get accredited to the meetitng.

(Mr. Wong, the contact for WG8 in Taiwan is
Dr. Ching-Chun Hsieh
Academica Sinica
Institute of Information Science
P.O. Box 1-7, Nankang
Taipei,      TAIWAN
Telephone: +886 2 782 5472
Facsimile: +886 2 783 6444
Network: hsieh@twnas886.bitnet)

--
Dr. James D. Mason
(WG8 Convenor)
Oak Ridge National Laboratory
Information Management Services
Bldg. 2506, M.S. 6302, P.O. Box 2008
Oak Ridge, TN  37831-6302   U.S.A.
Telephone: +1 615 574 6973
Facsimile: +1 615 574 6983
Network: masonjd@ornl.gov
</message>
<message id="<9405161736.AA31933@source.asset.com>" date="2978098565">
Newsgroups: comp.text.sgml
Date: 16 May 1994 17:36:05 UT
From: Len Bullard \<bullardc@source.asset.com>
Message-ID: <9405161736.AA31933@source.asset.com>
References: <9405121902.AA47336@source.asset.com> <2r0v36$8qf@search01.news.aol.com> <1994May16.145935.9648@xgml.com>
Subject: Re: Tool limitations (was nesting entities)

[Eric Skinner]

|   this new rule can support the upcoming SGML Open Entity Mapping format
|   directly...

Can you describe what the upcoming SGML Open Entity Mapping format is and
why users would want to support it other than writing entity managers?  I
find myself uncomfortable with the notions of non-ISO extensions to the
standard, but that may not be what this mapping format is.  Please
exemplify.  For example, since so much of HyTime relies on the entity
processing, what does this format do to support such processing?  Does the
format limit what we do with entities, make it more explicit or
transportable, speed up the processing, or limit how one uses entities?

Truly uninformed.

Len Bullard
</message>
<message id="<masonjd.17.2DD7B01B@ornl.gov>" date="2978098843">
Newsgroups: comp.text.sgml
Date: 16 May 1994 17:40:43 UT
From: James David Mason \<masonjd@ornl.gov>
Organization: Convenor, ISO/IEC JTC1/SC18/WG8
Message-ID: \<masonjd.17.2DD7B01B@ornl.gov>
References: <1994May12.223731.12916@ctp.com>
Subject: Re: MHEG format

[Eddy Wong]

|   Hi netters,
|
|   Has anybody heard about a format called 'MHEG'?  All I know is that
|   it's related to text processing.
|
|   Any information would be appreciated.

MHEG is a group, not a data format (at least at this time).

MHEG =3D Multimedia/ Hypermedia Experts Group

MHEG is pretty much synonymous with ISOIEC JTC1/SC29/WG12. They are
developing a static representation (perhaps for a telecommunications
tariffed service) of a multimedia document.  The chairman of SC28/WG12 is

Mme. Fran=E7oise Cola=EFtis
Convenor, ISO/IEC JTC1/SC29/WG12
CCETT
4, rue du Clos-Courtel
B.P. 59
Cesson-S=E9vign=E9 , CEDEX  35512   FRANCE
Telephone: +33 99 02 41 11
Telex: 740 284 F

The related project in the SGML/HyTime world is SMSL, for which, see
earlier postings.

--
Dr. James D. Mason
(ISO/IEC JTC1/SC18WG8 Convenor)
Oak Ridge National Laboratory
Information Management Services
Bldg. 2506, M.S. 6302, P.O. Box 2008
Oak Ridge, TN  37831-6302   U.S.A.
Telephone: +1 615 574 6973
Facsimile: +1 615 574 6983
Network: masonjd@ornl.gov
</message>
<message id="<94136.124700U35395@uicvm.uic.edu>" date="2978099220">
Newsgroups: comp.text.sgml
Date: 16 May 1994 17:47:00 UT
From: C. M. Sperberg-McQueen \<U35395@uicvm.uic.edu>
Organization: University of Illinois at Chicago, ADN Computer Center
Message-ID: <94136.124700U35395@uicvm.uic.edu>
Subject: TEI guidelines published

Readers of this list/group will, I hope, be interested in the following
announcement; please feel free to re-post it to other appropriate
lists and groups.  Thanks. -CMSMcQ

-----

           TEXT ENCODING INITIATIVE PUBLISHES GUIDELINES

On May 16, the Text Encoding Initiative (TEI) publishes its
"Guidelines for Electronic Text Encoding and Interchange."

This report is the product of several years' work by over a hundred experts
in fields ranging from computational linguistics to Ancient Greek
literature.  The Guidelines define a format in which electronic text
materials can be stored on, or transmitted between, any kind of computer
from a personal microcomputer to a university mainframe.  The format is
independent of the proprietary formats used by commercial software
packages.

The TEI came into being as the result of the proliferation of mostly
incompatible encoding formats, which was hampering cooperation and reuse of
data among researchers and teachers.  Creating good electronic texts is an
expensive and time-consuming business.  The object of the TEI was to ensure
that such texts, once created, could continue to be useful even after the
systems on which they were created had become obsolete.  This requirement
is a particularly important one in today's rapidly evolving computer
industry.

To make them "future-proof", the TEI Guidelines use an international
standard for text encoding known as SGML, the Standard Generalized Markup
Language.  SGML was originally developed by the publishing industry as a
way of reducing the costs of typesetting and reuse of electronic
manuscripts but has since become widely used by software developers,
publishers, and government agencies.  It is one of the enabling
technologies which will help the new Digital Libraries take shape.

The TEI Guidelines go beyond many other SGML applications currently in use.
Because they aim to serve the needs of researchers as well as teachers and
students, they have a particularly ambitious set of goals.  They must be
both easily extensible and easily simplified.  And their aim is to specify
methods capable of dealing with all kinds of texts, in all languages and
writing systems, from any period in history.

Consequently, the TEI Guidelines provide recommendations not only for the
encoding of prose texts, but also for verse, drama, and other performance
texts, transcripts of spoken material for linguistic research,
dictionaries, and terminological data banks.

The Guidelines provide detailed specifications for the documentation of
electronic materials, their sources, and their encoding.  These
specifications will enable future librarians to catalogue electronic texts
as efficiently and reliably as they currently catalogue printed texts.

The TEI Guidelines also provide optional facilities which can be added to
the set of basic recommendations.  These include methods for encoding
hypertext links, transcribing primary sources (especially manuscripts),
representing text-critical apparatus, analyzing names and dates,
representing figures, formulae, tables, and graphics, and categorizing of
texts for corpus-linguistic study.  The Guidelines also define methods of
providing linguistic, literary, or historical analysis and commentary on a
text and documenting areas of uncertainty or ambiguity.


The TEI Guidelines have been prepared over a six-year period with grant
support from the U.S. National Endowment for the Humanities, Directorate
General XIII of the Commission of the European Union, the Andrew W. Mellon
Foundation, and the Social Science and Humanities Research Council of
Canada.  The effort is largely the product of the volunteer work of over a
hundred researchers who donated time to share their experience in using
computers and to work out the specific recommendations in the Guidelines.

The project is sponsored by three professional societies active in the area
of computer applications to text-based research: the Association for
Computers and the Humanities, the Association for Literary and Linguistic
Computing, and the Association for Computational Linguistics, which have a
combined membership of thousands of scholars and researchers worldwide.

Many projects in North America and Europe have already declared their
intention of applying the TEI Guidelines in the creation of the large scale
electronic textual resources which are increasingly dominating the world of
humanities scholarship.

The Guidelines are available in paper form or electronic form over the
Internet.  For more information contact the TEI editors by e-mail at
tei@uic.edu or lou@vax.ox.ac.uk. Orders may be placed at the TEI offices in
Chicago, Oxford or Chiba, addresses of which follow:

In Europe: TEI Orders, Oxford University Computing Services,
        13 Banbury Road, Oxford OX2 6NN, UK
        fax +44 865 273275

In East Asia: Prof. Syun Tutiya, Department of Philosophy
        Chiba University, 1-33 Yayoi-cho Inage-ku
        Chiba 263, Japan
        fax: +81 43 290 2287

Rest Of World:  C. M. Sperberg McQueen, University of Illinois at
        Chicago, Academic Computing Center (M/C 135), 1940 W. Taylor,
        Rm. 124, Chicago IL  60612-7352, USA
        fax: +1 312 668 6834
</message>
<message id="<Cpws9r.5wA@news.cis.umn.edu>" date="2978102928">
Newsgroups: comp.text.sgml
Date: 16 May 1994 18:48:48 UT
From: R A Milowski \<milor001@maroon.tc.umn.edu>
Organization: University of Minnesota
Message-ID: \<Cpws9r.5wA@news.cis.umn.edu>
References: <9405161512.AA22556@source.asset.com>
Subject: Re: SOM

[Len Bullard]

|   Pardon my ignorance.  What is an SOM object?

I believe it stands for "Standard Object Model".  It is a "defacto
standard" developed by IBM for standardize object representation so objects
can be shared by applications.  I think.... :)

--
R. Alexander Milowski
SGML Researcher
milor001@maroon.tc.umn.edu
</message>
<message id="<9405162020.AA04719@wpdis03.wpafb.af.mil>" date="2978108424">
Newsgroups: comp.text.sgml
Date: 16 May 1994 20:20:24 UT
From: "Richard G. Littleton" \<rlittlet@wpdis03.wpafb.af.mil>
Message-ID: <9405162020.AA04719@wpdis03.wpafb.af.mil>
References: \<CpIJv2.HpE@news.Hawaii.Edu> <9405111956.AA18658@wpdis03.wpafb.af.mil> <9405121304.AA16979@source.asset.com> <9405121435.AA02475@tecnet1.jcte.jcs.mil>
Subject: Re: Documentation

[Len Bullard]

|   Does the Air Force provide or recommend a FOSI for the 38784 DTD?

[Debby Young]

|   I don't know about the Air Force providing a FOSI (I haven't found one yet)
|   but the Naval Air community will be finalizing the FOSI for 38784C for our
|   use by the end of June.  The Naval Air Electronic TM Team has also produced
|   an IDEF model which defines the management process for DTDs and FOSIs,
|   including configuration control, improving, and testing.  When the FOSI is
|   finalized, we will place it in the public domain.

For Air Force manuals, you should use the DTD that is appended to the
specification, never the 28001/38784B DTD.

The reason the 38784 DTD does not work for JCALS is because they are trying
to use it for legacy data.  Anyone who has dealt with tech manuals (at
least in the AF) knew this DTD would not support legacy data since most
manuals are unique unto themselves or their series.  The 38784C (and draft
D) DTDs are intended to be used on manuals when the spec is actually
followed.

The AF does indeed have a (tri-Service) FOSI for 38784D on the AFCPO BBS.
There are also several test suites on this BBS that contain a DTD, FOSI,
Tagged Instance, and a Word Perfect 5.1 Document to compare against
processing system output.

Rich
</message>
<message id="<19940516.2270@naggum.no>" date="2978109412">
Newsgroups: comp.text.sgml
Date: 16 May 1994 20:36:52 UT
From: Erik Naggum \<erik@naggum.no>
Organization: Naggum Software; +47 2295 0313
Message-ID: <19940516.2270@naggum.no>
References: <94136.124700U35395@uicvm.uic.edu>
Subject: Re: TEI guidelines published

[C. M. Sperberg-McQueen]

|	       TEXT ENCODING INITIATIVE PUBLISHES GUIDELINES
|
|   On May 16, the Text Encoding Initiative (TEI) publishes its
|   "Guidelines for Electronic Text Encoding and Interchange."

the files are available in FTP.IFI.UIO.NO:/pub/SGML/TEI with the original
file names from LISTSERV@UICVM.UIC.EDU, except for the space between name
and type, which is a dot here.  you may want to retrieve the FILELIST first.

best regards,
\</Erik>
--
Erik Naggum \<erik@naggum.no> \<SGML@ifi.uio.no>  |  memento, terrigena.
ISO 8879 SGML, ISO 10744 HyTime, ISO 10646 UCS  |  memento, vita brevis.
</message>
<message id="<9405162118.AA31112@source.asset.com>" date="2978111910">
Newsgroups: comp.text.sgml
Date: 16 May 1994 21:18:30 UT
From: Len Bullard \<bullardc@source.asset.com>
Message-ID: <9405162118.AA31112@source.asset.com>
References: \<CpIJv2.HpE@news.Hawaii.Edu> <9405111956.AA18658@wpdis03.wpafb.af.mil> <9405121304.AA16979@source.asset.com> <9405121435.AA02475@tecnet1.jcte.jcs.mil> <9405162020.AA04719@wpdis03.wpafb.af.mil>
Subject: Re: Documentation

Thanks, Rich for setting me straight on the Air Force position and the spec
position.  Most of my work, at this time, involves IETMs and the 87269 DTD.
However, I try to stay current with the CALS SGML as most of the new legacy
is coming from there |-).  My problems with the CALS DTD examples stem from
the liberal way that parameter entities were used to include whole chunks
of datatypes that allowed strange contents (e.g., DoDLists in footnotes).
It seemed to me that in order to prevent the proliferation of DTDs and to
allow every exception possible, the DTD example became very very loose.
When taught to novices, it was often not explained the extent to which the
entities should be tailored in order to prevent this.  Otherwise, the parse
isn't doing much to help one control content.  One could argue that this is
not its place, but I contend that if the attempt to control the markup
doesn't extend to restricting the conditions allowed by type def, we aren't
using the parser for much that is significant.

Also, very frankly, I believe that the perception of the difficulty of
writing DTDs is very much overblown.  In one sense, the DTD controls
markup, but in another sense, it allows us to tailor that control to enable
local conditions to prevail.  Where this goes against the tradition of
centralized control of all things important that prevails in military
applications, it is very much in the spirit of distributed control of all
things that can only be sensibly decided locally.  Somewhere between these
notions is practical approach to using SGML across very large enterprises,
of which the services are an outstanding example.  If more energy were
expended on educating the user population in the basics and tools of SGML,
time were allowed for them to gain experience, a better method presented,
less money would be spend on committee meetings, consultants, and
contractors.

As I have profited from all of these experiences, as have most of my
professional peers, perhaps that is not a survival position; but as one who
loves the language and sees it as a potentially large contributor to global
well-being, I can only hope we are replaced by a much larger base of
well-educated, experienced practicioners.

Len Bullard
</message>
<message id="<1994May16.214829.198@ousrvr.oulu.fi>" date="2978113709">
Newsgroups: comp.text.sgml,comp.text
Date: 16 May 1994 21:48:29 UT
From: Janne Himanka \<shem@syy.oulu.fi>
Reply-To: Janne.Himanka@oulu.fi
Organization: Tilde's Home Bakery
Message-ID: <1994May16.214829.198@ousrvr.oulu.fi>
References: <94136.124700U35395@uicvm.uic.edu>
Subject: Re: TEI guidelines published

The files are browsable via WWW and searchable via WAIS at

http://www.oulu.fi/TEI.html

This is just a quick hack I put together to aid myself in perusing the TEI
documentation.  I hope someone builds a pretty book interface to the TEI
documentation soon.

Janne Himanka
--
      Verba iactas           + shem@syy.oulu.fi (MIME)
      ut imber aquas         + Oulu University, Dept. of Library & Inf. Sc.
      at cave! vacca volat.  + http://www.oulu.fi/jh.html
</message>
<message id="<2r9fk8$27s@usenet.rpi.edu>" date="2978136136">
Newsgroups: comp.text.sgml
Date: 17 May 1994 04:02:16 UT
From: "Todd G. Hivnor" \<hivnot@vccnorth11.its.rpi.edu>
Organization: Rensselaer Polytechnic Institute, Troy NY
Message-ID: <2r9fk8$27s@usenet.rpi.edu>
References: \<mbroschi.7.000EF6AD@email.state.ut.us>
Subject: Re: HTML primer: where to obtain?

[Mike Broschinsky]

|   I am looking for a quick reference to HTML or some sort of primer which
|   I can print out to use as a reference while creating HTML documents.

Well, you can't quite print these out, but:

    http://oneworld.wa.com/htmldev/devpage/dev-page1.html#doc-a
    http://www.ncsa.uiuc.edu/demoweb/html-primer.html

|   Our office currently creates a Folio Infobase version of the _Utah
|   Adminstrative Code_ (the compilation of all executive branch
|   regulations of the state of Utah).  I am interested in trying to create
|   an HTML version of this compilation.

Somewhere out on the net is a .FFF <--> SGML convertor, which runs on DOS.
That still leaves some work for you, but its just SGML <--> SGML, so it
shouldn't be bad.  I found it a while ago, but don't remember where it was.
If you can't find it drop me some mail & check around my office.

hivnot@rpi.edu
</message>
<message id="<2r9g7r$2in@usenet.rpi.edu>" date="2978136763">
Newsgroups: comp.text.sgml
Date: 17 May 1994 04:12:43 UT
From: "Todd G. Hivnor" \<hivnot@vccnorth11.its.rpi.edu>
Organization: Rensselaer Polytechnic Institute, Troy NY
Message-ID: <2r9g7r$2in@usenet.rpi.edu>
References: <16FB8107D6S85.UPP201@ibm.rhrz.uni-bonn.de>
Subject: Re: SGML, HTML and LaTeX

[Michael Burschik]

|   Are there any tools for converting LaTeX files into SGML or HTML?

la2html is a perl script which converts from LaTeX to HTML.  It only runs
on UNIX, and requires some extra software for bitmap convertsion etc.

http://cbl.leeds.ac.uk/nikos/tex2html/doc/latex2html/latex2html.html

ftp://src.doc.ic.ac.uk/computing/information-systems/www/tools/
  translators/latex2html/latex2html.gz

hivnot@rpi.edu
</message>
<message id="<2r9jjc$6g5@search01.news.aol.com>" date="2978140204">
Newsgroups: comp.text.sgml
Date: 17 May 1994 05:10:04 UT
From: Eliot Kimber \<drmacro@aol.com>
Message-ID: <2r9jjc$6g5@search01.news.aol.com>
References: <50@markj.win.net>
Subject: Re: Author/Editor and 12083 DTDs

[Mark T. Jacobson]

|   I keep getting an error message indicating that points to the end of
|   the DTD indicating that an unexpected character was found and that < is
|   expected.  Can someone help me figure how to get this DTD to compile?

This error can be caused by having a ctrl-Z character at the end of the
document, which for some reason, mkrls refuses to treat as an EOF character
and instead, realizing that ctrl-Z is not declared in the SGML declaration,
complains that it has found a bad character, chokes and dies.  Poor form.

Try editing the DTD with Windows Write or the DOS editor and save it.
Under OS/2, if you edit DTDs with the Enhanced Editor, you need to save
with the /ne option (save /ne from the command dialog).  Took me a while to
figure that one out.

--
\<Address HyTime=bibloc>
W. Eliot Kimber (kimber@passage.com) Systems Analyst and HyTime Consultant
Passage Systems, Inc., 9971 Quail Blvd., Suite 903, Austin TX 78758, +1 512 339 1400
465 Fairchild Dr., Suite 201, Mountain View, CA  94043, +1 415 390 0911
\</Address>
</message>
<message id="<2r9jqr$6hm@search01.news.aol.com>" date="2978140443">
Newsgroups: comp.text.sgml
Date: 17 May 1994 05:14:03 UT
From: Eliot Kimber \<drmacro@aol.com>
Message-ID: <2r9jqr$6hm@search01.news.aol.com>
References: <1994May16.145935.9648@xgml.com>
Subject: Re: Tool limitations (was nesting entities)

In the first post with this subject, I mentioned that InContext did not
provide a way to map public identifiers.  I have been informed that this is
not the case, that the documentation failed to mention this feature.
InContext is, as far as I know, unique among SGML editors in that it
doesn't require you to pre-process a document type before you can edit a
document.  In other words, you just edit away.  That's kind of a nifty
feature, especially if you use a lot of ad-hoc document types.

--
\<Address HyTime=bibloc>
W. Eliot Kimber (kimber@passage.com) Systems Analyst and HyTime Consultant
Passage Systems, Inc., 9971 Quail Blvd., Suite 903, Austin TX 78758, +1 512 339 1400
465 Fairchild Dr., Suite 201, Mountain View, CA  94043, +1 415 390 0911
\</Address>
</message>
<message id="<1994May17.083731.16013@leeds.ac.uk>" date="2978152651">
Newsgroups: comp.text.sgml
Date: 17 May 1994 08:37:31 UT
From: Nikos F Drakos \<nikos@cbl.leeds.ac.uk>
Message-ID: <1994May17.083731.16013@leeds.ac.uk>
References: <16FB8107D6S85.UPP201@ibm.rhrz.uni-bonn.de>
Subject: Re: SGML, HTML and LaTeX

[Michael Burschik]

|   There seem to be some tools that convert SGML or HTML documents into
|   LaTeX for printing.  What is the most recommendable package?  Are there
|   any tools for converting LaTeX files into SGML or HTML?

Try http://info.cern.ch/hypertext/WWW/Tools/
for a list of conversion tools to and from HTML.

More information on LaTeX2HTML is available at
http://cbl.leeds.ac.uk/nikos/tex2html/doc/latex2html/latex2html.html

If you have a forms-capable browser (eg Mosaic) you can even try running
LaTeX2HTML remotely at
http://cbl.leeds.ac.uk/~nikos/doc/www94/section3_5.html

Nikos.

--
Nikos Drakos			
Computer Based Learning Unit   	nikos@cbl.leeds.ac.uk
University of Leeds		http://cbl.leeds.ac.uk/nikos/personal.html
</message>
<message id="<2ra5d7$jb7@osiris.wu-wien.ac.at>" date="2978158439">
Newsgroups: comp.text.sgml
Date: 17 May 1994 10:13:59 UT
From: Rony Flatscher \<Rony.Flatscher@wu-wien.ac.at>
Organization: University of Economics and Business Administration
Message-ID: <2ra5d7$jb7@osiris.wu-wien.ac.at>
References: <9405161512.AA22556@source.asset.com> \<Cpws9r.5wA@news.cis.umn.edu>
Subject: Re: SOM

[Len Bullard]

|   Pardon my ignorance.  What is an SOM object?

[R A Milowski]

|   I believe it stands for "Standard Object Model" It is a "defacto
|   standard" developed by IBM for standardize object representation so
|   objects can be shared by applications.  I think.... :)

It stands for "System Object Model" and is basically an object-wrapper
which allows for intermixing programming languages in an
object-style-manner (even non-object languages with object-oriented
languages).

DSOM stands for "distributed system object model" which allows for
intermixing (or just using) objects transparently even over a local area
network and is a CORBA-compliant extension to SOM.  The "client" does not
need to know whether the server-object is locally available or not.  If it
is locally unavailable and it is available someplace in the LAN it will be
used, regardless of the fact whether the server-object runs on the same
operating system!  Hence you could intermix OS/2-, Windows-, Unix- and
Apple-objects transparently.

SOM was developed by IBM and has been accepted as an industry standard with
the notable exception of Microsoft.  OpenDoc (originally developed by
Apple, now in cooperation - I think - with HP, IBM, Novell, Oracle,
WordPerfect, Xerox) uses IBM's DSOM-technology because of its features
(OpenDoc is being positioned as an industry standard which is
operating-system independent and a superset to Microsoft's OLE; of course
Microsoft does not adhere to the OpenDoc-standard at the moment and fights
it).

The nice thing about a possible SOM-SGML-parser, especially in the case of
DSOM is, that any workstation could transparently utilize that parser, no
matter where, no matter how and no matter on which operating system it was
implemented.  The same would be true for SGML-applications.

---rony
</message>
<message id="<2ra6fd$32b@ricrdc2>" date="2978159533">
Newsgroups: comp.text.sgml
Date: 17 May 1994 10:32:13 UT
From: Shigeya Senda \<senda@ipe.rdc.ricoh.co.jp>
Organization: Information\&Communication R\&D Center, RICOH Co., Ltd.
Message-ID: <2ra6fd$32b@ricrdc2>
References: <2r71dj$t1d@ricrdc2> <19940516.2247@naggum.no>
Subject: Re: [Q] question about marked section

[Shigeya Senda]

|   I have a question in marked section grammer.
|
|   At "status keyword specification" in marked section declaration, we can
|   use parameter entity references (and cannot use general entity
|   references) ...
:
|   Why parameter entity?  I feel it is natural to use general entity
|   reference in document instance part.

[Erik Naggum]

|   a parameter entity reference is required because the keyword is a
|   parameter in the marked section declaration.

I understand.  The entity reference definition itself isn't reflected in
document contents, so it must be a parameter.

Thank you for replying.

--
+--------------------------------------------------------------------+
|   RICOH RESEACH AND DEVELOPMENT CENTER                S.SENDA      |
+--------------------------------------------------------------------+
</message>
<message id="<19940517.2277@naggum.no>" date="2978161951">
Newsgroups: comp.text.sgml
Date: 17 May 1994 11:12:31 UT
From: Erik Naggum \<erik@naggum.no>
Organization: Naggum Software; +47 2295 0313
Message-ID: <19940517.2277@naggum.no>
References: <769022215snz@pctrain.demon.co.uk>
Subject: Re: CONREF

[Basil Stephens]

|   Examples I have seen associate the Use of CONREF with the use of no end
|   element, obviously as defined by the .dtd; why is this, please?  I
|   can't find anything in Goldfarb so far (;-)) which requires this!

presuming you mean "no end-tag", the reason is that a CONREF attribute, if
specified, makes the element empty.  if it is not specified, the element
has the element of its content model.  see [408:20] in the SGML Handbook
and the note at [409:1], which, for your convenience, reads:

    "O" should be specified for end-tag minimization if the element has
    a content reference attribute or a declared value of "EMPTY".

    NOTE -- Specifying "O" serves as a reminder that empty elements do
    not have end-tags (although this has nothing to do with markup
    minimization).

obviously, that "declared value" should have been "declared content".

however, I think this is one of the things that should be slashed in the
revision.  the "O" is for "omitted" as in markup minimization, but empty
elements do not have an end-tag they can omit in the first place.  if I
want an explicit end-tag _and_ I want a CONREF attribute, some parsers will
yell at me.  now, I can handle that, since I go write my own parser when I
get sufficiently pissed off by obnoxious warnings, or patch the code so it
doesn't yell, anymore, but other users may get frustrated if their element
type is such that end-tag omission for such elements creates problems, and
they may do unnecessary and cumbersome things to silence the warnings, in
the end thinking that SGML is broken because it doesn't allow them to do
very obviously Right Things.  such is the nature of excessive reminders.

\<e.g.>

    \<!ELEMENT recurse - - (recurse+)>
    \<!ATTLIST recurse halt CDATA #CONREF>

possibly not very useful in this bare-bones version, but this illustrates
that end-tags _cannot_ be omitted unless the recursion is halted with the
"halt" attribute.
\</e.g.>

so my suggestion is: ignore any warnings you may get, and ignore the
"reminder" and any warnings reminding you to insert the reminder.

best regards,
\</Erik>
--
Erik Naggum \<erik@naggum.no> \<SGML@ifi.uio.no>  |  memento, terrigena.
ISO 8879 SGML, ISO 10744 HyTime, ISO 10646 UCS  |  memento, vita brevis.
</message>
<message id="<19940517.2278@naggum.no>" date="2978163781">
Newsgroups: comp.text.sgml
Date: 17 May 1994 11:43:01 UT
From: Erik Naggum \<erik@naggum.no>
Organization: Naggum Software; +47 2295 0313
Message-ID: <19940517.2278@naggum.no>
References: <1994May16.145935.9648@xgml.com> <2r9jqr$6hm@search01.news.aol.com>
Subject: Re: Tool limitations (was nesting entities)

[Eliot Kimber]

|   InContext is, as far as I know, unique among SGML editors in that it
|   doesn't require you to pre-process a document type before you can edit
|   a document.  In other words, you just edit away.  That's kind of a
|   nifty feature, especially if you use a lot of ad-hoc document types.

if an SGML editor uses a pre-processed DTD, that's fine by me, but I get
disappointed, as a programmer, when programmers move the responsibility of
such tasks as synchronization between DTD and pre-processed DTD into the
users' domain.  it baffles me that users are willing to accept such
responsibility and don't just ask the vendor to come back when they have
done a better job.

I find it ironic that so-called "user friendly" programs require the user
to produce the pre-processed DTDs by hand.  such synchronization is hard,
humans are error-prone, and lack of synchronization is likely to cause
damage or at least wasted time.  if the programmer needs it and creates it,
the programmer should be responsible for maintaining it, too.  asking the
user to do it is rude and obnoxious.

I maintain that all this is possible only because users are willing to put
up with too much shit from the software vendors.  I mean, if software
vendors produced knives, they would be all doubled-edge blade, no handle,
and they would market them as major improvements over knives with handles.
does it have to be this way?  no.  but the _users_ have to start asking
serious questions, not just nod and accept whatever the salesman tells
them, and _especially_ not just accept whatever the salesman does _not_
tell them.

I contend that it is the user interface obsession that has caused all this.
software used to be design, algorithms, elegance in solving problems,
heuristics when they were too complex and new languages to express more
complicated ideas.  we weren't perfect, and bugs still persisted, but not
the modern kind of bugs.  machines used to run so long they had to be taken
down for "preemptive maintenance" once every three months or so.  on a PC,
"preemptive maintenance" would mean not turning it on.  hardware has become
incredibly much faster, and software has become incredibly much slower.
the voodoo marketing folks that interpret "market forces" say "users" want
more user friendly software, which is translated into "more colors", "more
menus", "more buttons", "more junk on the user's screen", "more stupid user
interaction (OK, Cancel, Don't care)".  in the hardware world, people can
put together whole computers where no two parts are from the same vendor,
and companies spring up that do just that and their boxes are really cheap,
too.  in the software world, everybody buys everything from single vendors,
and not one component will work with anything else unless from the same
vendor, and not even then.  why?  are the users just plain _nuts_, or is
there a deeper explanation?

well, I don't know, but you can at least start by demanding SGML editors
that don't need those confounded pre-processed document types that makes it
very hard for you to insert a new entity declaration without doing another
stupid pre-processing pass.  it used to be a design goal with SGML that you
should be able to declare all components of a document before you used
them, but since these editors don't allow you to do that (easily), you go
use attribute values that gets interpreted as file names, instead.  argh!

more power to InContext and tools that follow its lead!

personally, I use GNU Emacs for all my text editing needs.  if I want
something that isn't in it, I can write it myself, and the language offered
by GNU Emacs for this purpose is _very_ powerful.  I think all computer
users should be programmers and all tools should have very powerful
programming languages suited to their needs.  (this also explains why I
don't have a car.)

best regards,
\</Erik>
--
Erik Naggum \<erik@naggum.no> \<SGML@ifi.uio.no>  |  memento, terrigena.
ISO 8879 SGML, ISO 10744 HyTime, ISO 10646 UCS  |  memento, vita brevis.
</message>
<message id="<Cpy5s7.1ww@unx.sas.com>" date="2978167110">
Newsgroups: comp.text.sgml
Date: 17 May 1994 12:38:30 UT
From: Mark E. Britt \<sasmsb@unx.sas.com>
Organization: SAS Institute Inc.
Message-ID: \<Cpy5s7.1ww@unx.sas.com>
Subject: Defining a Display in HTML

I'm trying to convert some interactive tutorials to HTML, and there is a
key design feature of the tutorials for which I cannot find a corresponding
HTML element.

The design of the tutorials is based on the premise that the learner should
see only certain information on the display, and to get more information,
s/he must press the \<Continue> button (or to go back to a previous display,
the \<Back> button).  In many cases, the tutorial "builds" on information
already displayed, as shown in the example below.

Therefore, in HTML I need to designate the display as a container so that
only the data contained within the display appears in the viewer/browser
(see the example below).  I've only just started investigating HTML, and so
far I've seen no element that handles this effect.  It appears that all
HTML documents are designed with the expectation that readers will scroll
forward and backward to view information.

Can this be done in HTML?  Is this an issue that's been brought up before?
Is it something that could be addressed with HTML+?  (I know that I could
make each display a separate document, but considering how many documents I
would then have to manage, this is out of the question.  Also, I know I can
link to each new "display" within a document, but then I have no control
over what the learner sees at any time).

Thanks for any information.

\<!-- ###################### EXAMPLE ########################### -->
\<!--      DISPLAY                      CODE                     -->

+----------------------------+         \<HTML>
|                            |         \<head>
|  In this section of the    |         \<title>Tutorial\</title>
|  tutorial, you learn how   |         \</head>
|  to:                       |         \<body>
|                            |         \<display>
|  > blah blah blah          |         \<p>
|                            |         In this section of the
|                            |         tutorial, you learn how
|   \<Continue>  \<Back>       |         to:
+----------------------------+         \<ul>
+----------------------------+         \<li>blah blah blah
|                            |         \</ul>
|  In this section of the    |         \</display>
|  tutorial, you learn how   |         \<display>
|  to:                       |         \<p>
|                            |         In this section of the
|  > blah blah blah          |         tutorial, you learn how
|  > blee blee blee          |         to:
|                            |         \<ul>
|   \<Continue>  \<Back>       |         \<li>blah blah blah
+----------------------------+         \<li>blee blee blee
+----------------------------+         \</ul>
|                            |         \</display>
|  In this section of the    |         \<display>
|  tutorial, you learn how   |         \<p>
|  to:                       |         In this section of the
|                            |         tutorial, you learn how
|  > blah blah blah          |         to:
|  > blee blee blee          |         \<ul>
|  > blow blow blow          |         \<li>blah blah blah
|                            |         \<li>blee blee blee
|   \<Continue>  \<Back>       |         \<li>blow blow blow
+----------------------------+         \</ul>
                                       \</display>
                                       \</body>
                                       \</HTML>

\<!-- #################### END OF EXAMPLE ######################## -->



--
Mark E. Britt | sasmsb@unx.sas.com | The lust for comfort murders the
CBT Developer | (919)677-8000x6497 | passion of the soul, and then
SAS Institute | Opinions are mine. | walks grinning in the funeral.
                                     - Kahlil Gibran, "The Prophet"
</message>
<message id="<19940517.2281@naggum.no>" date="2978170100">
Newsgroups: comp.text.sgml
Date: 17 May 1994 13:28:20 UT
From: Erik Naggum \<erik@naggum.no>
Organization: Naggum Software; +47 2295 0313
Message-ID: <19940517.2281@naggum.no>
References: <9405161512.AA22556@source.asset.com>
Subject: Re: SOM

[Len Bullard]

|   Pardon my ignorance.  What is an SOM object?

I sent this to Len directly, but it seems others need the information, too.

you can find more information about SOM in ftp.cil.org:/pub/SOM-interest.
SOM stands for System Object Model.  I attach the file SOM-brief.txt from
the above-mentioned archive (slightly edited).

best regards,
\</Erik>
---------------------------------------------------------------------------


Making Reuse a Reality --
Introducing the SOMobjects Developer Toolkit

Object oriented (OO) technology represents a fundamental change in the
concept of software development.  The promise of OO technology is for
software objects -- segments of code combining data and procedures -- to
become standard, off-the-shelf parts.  A programmer can then incorporate
these objects in a software system as easily as an engineer incorporates an
off-the-shelf semiconductor chip in a circuit board design.


The Future is Now

Creating interchangeable, reusable software components is very much a
reality today: you can choose from a variety of object oriented programming
languages, design and analysis aids, and visual programming tools.  You can
also find OO enablers and frameworks built into the latest releases of some
operating systems, including IBM Operating System/2(R) [OS/2(R)] Versions
2.0 and 2.1.  What you haven't been able to find -- until now -- are some
key pieces needed to complete a system that works as smoothly and openly as
the hardware development process.

Now, SOMobjects(TM) tools fill in some of these missing pieces by breaking
objects free from ties to a specific programming language.  On a single
machine, the System Object Model (SOM) provides an object-structured
protocol that allows applications to access and use objects and object
definitions, regardless of what programming language created them, with no
need to recompile the application.  SOM's language-neutral character not
only allows robust software objects to be easily used and reused wherever
they're needed, it also enables a greater degree of openness than ever
before in the development and use of object oriented programming (OOP)
facilities across multiple operating platforms.

What's more, SOMobjects incorporates Distributed SOM (DSOM) technology that
provides a base for OOP development and use over entire networks.  With the
IBM SOMobjects Developer Toolkit Version 2.0, programmers can start taking
advantage of SOM and DSOM immediately.	SOMobjects is available for both
the OS/2 2.0 (or higher) and the IBM AIX/6000(TM) 3.2 (or higher) operating
systems and is planned for the Windows environment.  Complying with
industrywide standards of the Object Management Group's [OMG(R)] Common
Object Request Broker Architecture (CORBA), SOMobjects provides an
extensive set of facilities for putting the power and speed of OO
technology to work for you.


Highlights of SOMobjects

IBM's SOMobjects gives professional programmers a comprehensive toolkit for
creating OO class libraries and instances of those classes -- that is,
objects that belong to particular classes -- using SOM and DSOM technology.
Among the product's powerful functions and benefits are the following:

>  Language-neutral support for the definition, construction and use of OO
   systems -- by defining an application programming interface (API) to SOM
   objects based on simple procedure calls -- allows applications written
   in different programming languages to use a common class library.

>  Upward binary compatibility of SOM class libraries, as dynamic link
   libraries (DLLs) that can be replaced without requiring recompilation of
   application source code, makes it easy for end users to install new
   releases of system libraries.

>  SOM/IDL Compiler is key to language neutrality and supports the OMG
   standard Interface Definition Language (IDL).  IDL gives individual
   object implementations the means to tell potential clients what
   operations are available and how they should be invoked.  By mapping the
   public and private types for a given SOM class to language-specific
   bindings, the compiler allows SOM objects to be used by different
   programming languages.

>  Distributed SOM supports OMG's CORBA standards: -- Workstation DSOM
   provides support for cross-process and cross-address transparent access
   to objects in a single-system environment.  -- Workgroup DSOM enables
   you to define or access objects remotely in a distributed workgroup
   local area network (LAN) configuration across supported platforms (e.g.,
   OS/2 to AIX/6000).

>  Language bindings for C and C++ allow programmers in those languages to
   use SOM objects and create new SOM object classes with ease.	 Other SOM
   bindings may be available from language vendors.

>  Replication Framework makes copies of a single object available
   concurrently to multiple clients, and maintains consistency among the
   copies, with updates to any one copy automatically reflected in all
   other copies.  It's especially useful for collaborative applications
   when you need updated information instantly.	 Fault tolerance is   provided to ensure object integrity in the event of a node failure.

>  Persistence Framework allows you to save and restore SOM objects to and
   from a repository that can be a file system, database, or object
   database.  You can also change to another format [e.g., Lotus® to
   Excel®].

>  Emitter Framework that produces an output file representing part or all
   of an object interface definition -- structured to minimize effort,
   improve quality, and simplify maintenance -- makes it easy to develop
   additional language bindings for SOM.

>  Collection Classes Framework gives programmers such frequently needed
   data structures as lists, sets, queues, and dictionaries, and lets them
   inherit from and use these SOM classes in applications, with no need to
   recode or retest the functions.

>  Workstation Enabler, for OS/2 or AIX/6000, enables the execution of
   SOM-based applications in a single-machine environment.

>  Workgroup Enabler, for OS/2 or AIX/6000, enables the execution of
   SOM-based applications across a multiple-node workgroup LAN distributed
   environment.


What You Get

IBM's SOMobjects offers you several packages for the development and
execution of SOM-based applications in OS/2 and AIX/6000 environments.
Each package is available separately.

>  SOMobjects Developer Toolkit Version 2.0 includes 3.5-inch diskettes
   containing the basic SOM kernel (SOM/IDL Compiler and language bindings
   for C and C++) and all of the frameworks, including Distributed SOM
   (DSOM). SOMobjects documentation is provided in hardcopy with the
   SOMobjects Developer Toolkit, along with the product license agreement.
		[Part number 96F8647 for OS/2; part number 96F8648 for AIX]

>  SOMobjects Workstation Enabler Version 2.0, for SOMobjects applications
   operating on a single machine, includes the following: -- SOM kernel
   runtime -- DSOM runtime -- Persistence runtime -- Replication runtime --
   Collection Classes runtime -- 3.5-inch diskettes containing the
   runtimes, as well as a hardcopy installation/configuration guide and the
   product license agreement.
				       [96F8673 for OS/2;  96F8674 for AIX]

>  SOMobjects Workgroup Enabler Version 2.0 includes 3.5-inch diskettes
   containing all of the runtimes listed above for the Workstation Enabler,
   but geared for SOMobjects applications operating across a multi-node
   LAN.	 A hardcopy installation/configuration guide and the product
   license agreement are also included.
					[96F8675 for OS/2; 96F8676 for AIX]

>  SOMobjects Documentation is included with the SOMobjects Developer
   Toolkit.  This set of hardcopy publications is also available
   separately, as the SOMobjects Developer Toolkit Publications Version 2.0
   package.
						      [Part number 96F8649]

Ordering and Training Information

>  U.S.	      +1 800 342 6672		  >  Canada	+1 800 465 7999
   Training   +1 800 426 8322		     Training	+1 800 661 2131
--
Erik Naggum \<erik@naggum.no> \<SGML@ifi.uio.no>  |  memento, terrigena.
ISO 8879 SGML, ISO 10744 HyTime, ISO 10646 UCS  |  memento, vita brevis.
</message>
<message id="<1994May17.140242.27932@tin.monsanto.com>" date="2978172162">
Newsgroups: comp.text.sgml,comp.text,comp.lang.misc,comp.lang.icon
Date: 17 May 1994 14:02:42 UT
From: Joel Finkle \<jjfink@skcla.monsanto.com>
Organization: Searle R\&D
Message-ID: <1994May17.140242.27932@tin.monsanto.com>
Subject: Spitbol reference?

I'm looking for reference material on the Spitbol language.  Our vendor is
using it to write converters into our SGML DTD from files parsed out of
WordPerfect, and I'd like to be able to make changes without having to wait
for them, sometimes.

I'd greatly appreciate hearing about any on-line materials specifying the
language, or even (gasp) ones I'd have to pay for.

Thanks in advance,

Joel Finkle
--
Joel Finkle
Searle R\&D
jjfink@skcla.monsanto.com

"And when I die don't bury me / in a box in a cemetery.
Out in the garden would be much better,
I could be pushin' up home grown tomatoes" -- Guy Clark, "Home Grown Tomatoes"
</message>
<message id="<D90-PIM.94May17161035@dront.nada.kth.se>" date="2978172634">
Newsgroups: comp.text.sgml
Date: 17 May 1994 14:10:34 UT
From: Pietro Mancino \<d90-pim@dront.nada.kth.se>
Organization: The Royal Institute of Technology
Message-ID: \<D90-PIM.94May17161035@dront.nada.kth.se>
Subject: Open Document Architecture \<ODA> is alive !!

At the end of March I sent a news in which I asked for help on my master's
thesis about ODA.  Here is also part of the answer I got from Erik Naggum
\<erik@naggum.no>.  So far, after approximately one month of research,
information gathering, contacts research, etc.  I can contradict more less
all the negative remarks that Erik Naggum made about ODA.

[Erik Naggum]

|   I won't say ODA is dead, but the only place it is still alive is in
|   ISO/IEC JTC 1/SC 18/WG 3 and in the corresponding ITU-TS study group.
|   they produce new parts and technical corrigenda so fast that if there
|   were any actual implementations, they could not conform to the
|   standard.

There is a well defined and no subject to alteration edition since December
1988.  The ODA (Open Document Architecture) is documented as ISO 8613 by
the ISO, as the T410 Series of recommendations by CCITT and as ECMA 101 by
the European Computer Manufacturers Association.  By the way, you can get a
free copy writing to ECMA rue du Rhone 114 ch-1204 Geneve Switzerland (an
excellent service).  Moreover this standard tries help users.  It means
that instead of wasting your time through a lot of paper, you can use a ODA
TOOLKIT that facilitate the creation and manipulation of documents
conforming to the ODA standard.  Application developer do not have to
struggle with the exact ODA specifications, as this is taken care of by the
toolkit.  It has been developed by the ODA Consortium (I'll speak about
that later).

|   there are no public domain ODA systems, no publicly available "kernels"
|   or parser kits that help you build applications on top of it, no
|   newsgroup for it, no FTP sites, no nothing.

False!  You can get a lot of information and references through Internet
using different kinds of information services.  There an FTP server
(cs.ucl.ac.uk under the directory ietf-osi-oda) where you can find ODA
documents.  You can find references to it in different place among them
through a lot of FAQ such as comp.protocols.iso.X400, comp.protocols.iso,
and others. 

If you try to use VERONICA access using the keyword ODA you get a lot of
links everywhere in the world about the ODA issue.  Among others you can
get the reference to ISO International Standardised Profiles (ISPs) that
are profiles for ODA have been agreed worl-wide (see FOD11, FOD26, FOD36).
There is the ODA Consortium in which six of the world's major computer
companies joined together in order to promote the ODA standard and to
provide the means to implement it in software applications.  The companies
are: Bull SA, DEC, IBM, ICL PLC, Siemens Nixdorf Inform.  SNI and Unisys.
Last January also WordPerfect has joined the ODA Consortium.  Bye the way
WordPerfect Corporation is sponsoring with the ODA Consortium the "ODA
Connections '94 Conference" which will take place on June 20-21 in Brussels
(Belgium).  For further information and the registration form contact:

    ODA Consortium
    Avenue Marcel Thiry 204
    1200 Brussels, Belgium
    Phone + 32 2 774 9623
    Fax + 32 2 774 9690

The goal of the conference is to give the opportunity to explore the ODA
standard and find out what ODA can do.  I suggest to write also to ODA
Consortium where you can find a lot of materials about the toolkit release
2 (Introduction manual, license Agreement, etc.) and you can get also the
ODA brochure, papers from the press information, etc.

|   ODA's third forte would have been in the ability to interchange
|   structured documents.  however, ODA defines structures in the standard
|   committees, and they get so big and still so useless that people can
|   only use them for very well-defined tasks, such as office documents,
|   which was what ODA was about before the committee was misled by the
|   promiscuity of the word "open"

ODA's forte would have been not only the ability to blindly interchange
documents but also be a single self-contained standard under which
documents, including both text and graphics, can be transmitted with all
attributes intact from one system to another for further editing,
processing, storing, printing and retransmission.  I do not think that
facsimile standards and Postscript provide the possibilities mentioned
above (unless the entire document is re-keyed wrt facsimile). There are a
lot of products dealing with these issues. (see the ODA brochure there an
ODA Software Catalogue with products from a lot of companies such as Bull,
Apple, ICL, Xerox, etc.).

|   there were a lot of activity around ODA back in the late 80's, and I'm
|   sure there were a number of articles written on it then.  you're not
|   the only student to have been asked to plod down this unused path, so
|   there should be a fair amount of theses available on it, as well.  but
|   if it is success as a concept, and as an application you're looking
|   for, and any future use of your knowledge in a standard, you should ask
|   you advisor if this topic is in your best interest.

Instead I think this is a very interesting topic also in the extend of
applicability.  There are a lot of activities, researchs, applications and
projects going on about ODA.  Some references can be found above.  For more
information you can contact also Mr Leon Surleau at Bull (telephone + 33 1
6993 80 86) or ODA Consortium.

Everybody interested can mail me requests.  I can give more information
about the references used in this news and more.

Regards,
Piero.
--
                                   E R I C S S O N ///
Piero Mancino
ST/ETX/TX/FD                       Tel : + 46 8 719 6004
Ericsson Telecom AB                Fax : + 46 8 719 3055
126 25 Stockholm                   e-mail: piero@stdoca.ericsson.se
Sweden                             Memo: ERI.EXTR.XKKPIMA
</message>
<message id="<2r9ugb$93f@felix.dircon.co.uk>" date="2978175628">
Newsgroups: comp.text.sgml
Date: 17 May 1994 15:00:28 UT
From: Bruce Hunter \<bruce@sgml.dircon.co.uk>
Message-ID: <2r9ugb$93f@felix.dircon.co.uk>
References: \<mbroschi.7.000EF6AD@email.state.ut.us> <2r9fk8$27s@usenet.rpi.edu>
Subject: Re: HTML primer: where to obtain?

[Todd G. Hivnor]

|   Somewhere out on the net is a .FFF <--> SGML convertor, which runs on
|   DOS.  That still leaves some work for you, but its just SGML <--> SGML,
|   so it shouldn't be bad.  I found it a while ago, but don't remember
|   where it was.  If you can't find it drop me some mail & check around my
|   office.

These tools are at at least two sites that I know of.  They are contained
in a file called fv3sgml.zip, and are at ftp.ifi.uio.no in /pub/SGML/Demo
and at sgml1.ex.ac.uk in /fv3sgml.  I have since produced an updated
version of the program fff2sgml.exe, which improves on the (rather basic)
error handling in the original release.  If anyone wants this let me know
and I'll mail it to you.  I'll also try to add it to the ftp sites.

Also in beta test at present is a generalised sgml to .fff transformation
product.  It is written using Balise, and uses an SGML file to specify the
transformation to be performed in a manner analagous to a FOSI (though the
syntax is different).  This allows you to import SGML files conforming to
arbitrary DTDs into Folio Views, though you obviously have to do the work
of specifying how the structures in the SGML file map into the structures
expected by Views.

best wishes,
--
	Bruce Hunter
	SGML Systems Engineering
	bruce@sgml.dircon.co.uk
</message>
<message id="<CpyDI0.M9M@world.std.com>" date="2978177112">
Newsgroups: comp.text.sgml,comp.text,comp.lang.misc,comp.lang.icon
Date: 17 May 1994 15:25:12 UT
From: John Smith \<sgmlsh@world.std.com>
Organization: The World Public Access UNIX, Brookline, MA
Message-ID: \<CpyDI0.M9M@world.std.com>
References: <1994May17.140242.27932@tin.monsanto.com>
Subject: Re: Spitbol reference?

[Joel Finkle]

|   I'm looking for reference material on the Spitbol language.

The vendor for Spitbol is:

    Catspaw Inc.
    PO Box 1123
    Salida, CO 81201 USA
    1-719-539-3884

    cats!support@cs.arizona.edu

--
\<!--
  "To imagine a language is to imagine a form
   of life."                   - Wittgenstein,
                 Philosophical Investigations
</message>
<message id="<2raob4$2ri@dancer.cc.bellcore.com>" date="2978177828">
Newsgroups: comp.text.sgml
Date: 17 May 1994 15:37:08 UT
From: Haim Kilov \<haim@dancer.cc.bellcore.com>
Organization: Bell Communications Research (Bellcore)
Message-ID: <2raob4$2ri@dancer.cc.bellcore.com>
References: <24916@dog.cme.nist.gov> <19940515.2234@naggum.no>
Subject: Re: nesting entities

[Erik Naggum]

|   the attitude you present is the _old_ attitude -- where software
|   vendors and programmers owned the users' data, and thought they had
|   every right to dictate the terms of its use.  we want to get away from
|   that, and instead have the vendors and programmers understand that they
|   are nothing without the data we ask them to process.  it is _our_ data,
|   not theirs.

Exactly!  E.W.Dijkstra in his 1976 book "A discipline of programming" (and
perhaps even earlier than that) stated that at the beginning of programming
the goal of our programs was to instruct our machines, but "now" we
understood that the goal of our machines is to execute our programs.

-Haim Kilov
haim@cc.bellcore.com
</message>
<message id="<Chameleon.4.00.940517103509.mwalter@ABU.ziff.com>" date="2978183013">
Newsgroups: comp.text.sgml
Date: 17 May 1994 17:03:33 UT
From: Mark Walter \<mwalter@zcias1.ziff.com>
Message-ID: \<Chameleon.4.00.940517103509.mwalter@ABU.ziff.com>
Subject: SGML at airlines

Does anyone know the whereabouts of articles, speeches or more info in
general about SGML projects at commercial airlines--Delta, United, BA,
Northwest, etc.?

--
Mark Walter		| \<mwalter@seyboldpub.ziff.com>
Seybold Publications	| PO Box 644, Media, PA
			| +1 215 565 2480; fax +1 215 565 4659
</message>
<message id="<2ratto$k31@fuhainf.fernuni-hagen.de>" date="2978183544">
Newsgroups: comp.text.sgml
Date: 17 May 1994 17:12:24 UT
From: Michael Schopen \<michael.schopen@Fernuni-Hagen.DE>
Organization: FernUniversitaet Hagen
Message-ID: <2ratto$k31@fuhainf.fernuni-hagen.de>
Subject: AuthorEditor 3.0

Who has got experience with Author/Editor 3.0 for MS-WINDOWS?  I have two
problems:

1. Font size on printer

When I load a file into Author/Editor 3.0, formatting is correct on the
screen, but on the printer (HP LaserJet III) the font sizes are wrong.  An
8 pt font is printed as a 10 pt font, a 10 pt font as a 12 pt font and so
on.  Does anybody know how to handle that?

2. Form feeds on printer

Is it possible to disable form feeds for certain elements, so that an
element is printed on the next side if it does not fit onto the previous
one?  I discovered an option "page-break" in the ASCII-styles, which is not
listed in the documentation and cannot be toggled by one of the VIEW menus.
If I toggle it in the ASCII-styles and reload that file, however, nothing
changes.  Has anybody got a solution?

Thanks for all replys
Michael
</message>
<message id="<9405172030.AA37387@source.asset.com>" date="2978195457">
Newsgroups: comp.text.sgml
Date: 17 May 1994 20:30:57 UT
From: Len Bullard \<bullardc@source.asset.com>
Message-ID: <9405172030.AA37387@source.asset.com>
References: <1994May16.145935.9648@xgml.com> <2r9jqr$6hm@search01.news.aol.com> <19940517.2278@naggum.no>
Subject: Re: Tool Limitations

Erik, its good to be powerful and knowledgeable and Lord Knows I also found
the GUIs to be a fine way to slow down a power user, somewhat similar to
QWERTY/DVORAK keyboard controversy.  However, lovers of the harpsichord
stood by it until that upstart pianoforte thing proved it could stay in
tune longer.  All learning is not progress and vice versa, but they are all
some form of evolution.  I have a friend who can read binary like it was
english, and I never hope to have his skill: just his occasional help.  But
as the days grow long, the babies get taller, and my mind weakens with
advancing age, I'm often grateful for the guys like you and Eliot who dig
deep, build wonderful compositions of bytes and longs, and save me a little
more time to be at home with the little ones.

Cheers,

Len
</message>
<message id="<1994May17.212253.12117@tin.monsanto.com>" date="2978198573">
Newsgroups: comp.text.sgml,comp.text,comp.lang.misc,comp.lang.icon
Date: 17 May 1994 21:22:53 UT
From: Joel Finkle \<jjfink@skcla.monsanto.com>
Organization: Searle R\&D
Message-ID: <1994May17.212253.12117@tin.monsanto.com>
References: <1994May17.140242.27932@tin.monsanto.com>
Subject: Re: Spitbol reference?

[Joel Finkle]

|   I'm looking for reference material on the Spitbol language.  Our vendor
|   is using it to write converters into our SGML DTD from files parsed out
|   of WordPerfect, and I'd like to be able to make changes without having
|   to wait for them, sometimes.

Boy, that was quick.

I've already had four people tell me SPITBOL is pretty much just compiled
SNOBOL, two of whom are now trying to convince me that I'm an idiot to
continue development with it, and are probably going to lead me down the
road to ICON.

I've found a PD SNOBOL4, with at least enough docs that I can understand
our SPITBOL code, so I'm off and running.

Dontcha just love the 'net?

Joel
--
Joel Finkle
Searle R\&D
jjfink@skcla.monsanto.com

"And when I die don't bury me / in a box in a cemetery.
Out in the garden would be much better,
I could be pushin' up home grown tomatoes" -- Guy Clark, "Home Grown Tomatoes"
</message>
<message id="<2rco77$5t2@newsy.ifm.liu.se>" date="2978243239">
Newsgroups: comp.text.sgml,comp.text
Date: 18 May 1994 09:47:19 UT
From: Lars Aronsson \<aronsson@lysator.liu.se>
Message-ID: <2rco77$5t2@newsy.ifm.liu.se>
References: <94136.124700U35395@uicvm.uic.edu> <1994May16.214829.198@ousrvr.oulu.fi>
Subject: Re: TEI guidelines published

[Janne Himanka]

|   The files are browsable via WWW and searchable via WAIS at
|   http://www.oulu.fi/TEI.html

|   This is just a quick hack I put together to aid myself in perusing

For the same reason, one month ago, I converted the old TEI P2 (the new
version is TEI P3) filelist to HTML.  For some inspiration, look at

    http://www.lysator.liu.se:7500/runeberg/teifiles.html

Doing this conversion takes some time, but less than a day.  I wish the
filelist would already be encoded in SGML, but you can't always get what
you want...

--
Lars Aronsson, Lysator, Linkoping University, Linkoping, Sweden
</message>
<message id="<9405181304.AA35323@source.asset.com>" date="2978255054">
Newsgroups: comp.text.sgml
Date: 18 May 1994 13:04:14 UT
From: Len Bullard \<bullardc@source.asset.com>
Message-ID: <9405181304.AA35323@source.asset.com>
References: \<Robbin.10.0011D180@[140.138.150.213]>
Subject: Re: Are there any HyTime DTD or DI sample?

Robbin:

As HyTime prescribes architectural forms for creating components of DTDs,
there are DTDs that rely on HyTime, but no DTDs that are HyTime as such.
The folks at TechnoTeacher Inc. have written and published several samples
that illustrate how HyTime forms can be used for various representation
tasks.  They are regular readers of this net so they will probably reply to
your request.  If you find more, please make them available to the rest of
us if possible.  The one thing besides more software that will advance the
cause of using Hytime would be more examples that clearly illustrate how to
use the features, and how the features relate to different windowing
environments (X, MsWindows, etc.).  The standard (10744) is written to be a
legal document; therefore, it is definitionally tight, but does not contain
sufficient examples, which, of course, is not its job.  The same was true
of the original 8879 which was also hard to decipher.  Both Eliot Kimber of
Passage Systems and Steve de Rose of EBT are publishing books soon on the
subject.  I have seen parts of both and am sure these works will be very
instructive.  Meanwhile, the First International Conference on Hytime in
Vancouver this summer should be an excellent place to get more information
and become acquainted with the cognoscenti.

An interesting question that arises when discussing HyTime is what is
"meta" about it.  That is, what forms can be altered, and which must be
used "as quoted".  It was explained to me that the element type forms can
be modified, but that the attribute forms were used "as is".  Perhaps
Eliot, Erik, or someone else can shed more light on this matter.  The use
of meta-terms is something new in the SGML practice, and articles that
describe the practical approach will be most helpful.

Len Bullard
</message>
<message id="<1994May18.142127.21845@sq.sq.com>" date="2978259687">
Newsgroups: comp.text.sgml
Date: 18 May 1994 14:21:27 UT
From: Donald Teed \<donald@sq.sq.com>
Organization: SoftQuad Inc., Toronto, Canada
Message-ID: <1994May18.142127.21845@sq.sq.com>
References: <50@markj.win.net> <2r9jjc$6g5@search01.news.aol.com>
Subject: Re: Author/Editor and 12083 DTDs

[Mark T. Jacobson]

|   I keep getting an error message indicating that points to the end of
|   the DTD indicating that an unexpected character was found and that < is
|   expected.  Can someone help me figure how to get this DTD to compile?

[Eliot Kimber]

|   This error can be caused by having a ctrl-Z character at the end of the
|   document, which for some reason, mkrls refuses to treat as an EOF
|   character and instead, realizing that ctrl-Z is not declared in the
|   SGML declaration, complains that it has found a bad character, chokes
|   and dies.  Poor form.
|
|   Try editing the DTD with Windows Write or the DOS editor and save it.
|   Under OS/2, if you edit DTDs with the Enhanced Editor, you need to save
|   with the /ne option (save /ne from the command dialog).  Took me a
|   while to figure that one out.

Author/Editor and the MKS Toolkit vi editor will also save a file without
appending a ^Z character to the end of file.

SoftQuad will soon be releasing new versions of Rules Builder for Windows,
Motif and Macintosh platforms.  Aside from now having a new, interactive,
GUI interface, many bugs have been fixed, including the problem with ^Z
characters at the end of some MSDOS/Windows files not being recognized as
end of file.

--Donald Teed, Technical Support, SoftQuad
</message>
<message id="<2rd8bm$bte@newsy.ifm.liu.se>" date="2978259766">
Newsgroups: comp.text.sgml,comp.text,comp.infosystems.www
Date: 18 May 1994 14:22:46 UT
From: Lars Aronsson \<aronsson@lysator.liu.se>
Message-ID: <2rd8bm$bte@newsy.ifm.liu.se>
References: <94136.124700U35395@uicvm.uic.edu> <1994May16.214829.198@ousrvr.oulu.fi> <2rco77$5t2@newsy.ifm.liu.se>
Subject: Re: TEI guidelines via FTP, WAIS, WWW

I have now converted the new TEI P3 Filelist to HTML.  See URL below.  You
might find this easier to read than Janne's quick version.  The hypertext
references are pointing to Erik Naggum's FTP archive.  I don't run WAIS, so
I keep a pointer to Janne's WWW server for that.

Text Encoding Initiative (TEI) P3 released on May 16, 1994, now on WWW
  ftp://ftp.ifi.uio.no/pub/SGML/TEI                     -- Erik Naggum
  http://www.oulu.fi/TEI.html                         -- Janne Himanka
  http://www.lysator.liu.se/runeberg/teip3files.html  -- Lars Aronsson
--
Lars Aronsson, Lysator, Linkoping University, Linkoping, Sweden
</message>
<message id="<JOHANUDD.94May18095250@othello.admin.kth.se>" date="2978261570">
Newsgroups: comp.text.sgml
Date: 18 May 1994 14:52:50 UT
From: Johan Udd \<johanudd@admin.kth.se>
Organization: Royal Institute of Technology, Sweden
Message-ID: \<JOHANUDD.94May18095250@othello.admin.kth.se>
References: \<D90-PIM.94May17161035@dront.nada.kth.se>
Subject: Re: ODA (Open Document Architecture) is alive !!

The ODA Consortiunm is sponsoring with WordPerfect Corporation the "ODA
Connections '94 Conference" that will take place on June 20-21 in Brussels,
Belgium.

The aim of the conference is to give participants the opportunity to
explore the Open Document Architecture standard (the international standard
for exchanging compound documents containing text, graphics, and other
information), and discover what ODA can do for them.  The first day there
will be a training seminar to review the complete ODA standard.  the second
day a developer's training session will show how to use the ODA Consortium
toolkit.  You may choose to attend either the first day's overview and
introduction of ODA or the second day's technical application conference,
or you may attend both days.

For further information, brochure and registration form contact:

    ODA Consortium
    Avenue Marcel Thiry 204
    1200 Brussels, Belgium
    Phone + 32 2 774 9623
    Fax   + 32 2 774 9690

Regards.
Johan
</message>
<message id="<9405181520.AA24176@wpdis03.wpafb.af.mil>" date="2978263210">
Newsgroups: comp.text.sgml
Date: 18 May 1994 15:20:10 UT
From: "Richard G. Littleton" \<rlittlet@wpdis03.wpafb.af.mil>
Message-ID: <9405181520.AA24176@wpdis03.wpafb.af.mil>
References: \<Chameleon.4.00.940517103509.mwalter@ABU.ziff.com>
Subject: Re: SGML at airlines

[Mark Walter]

|   Does anyone know the whereabouts of articles, speeches or more info in
|   general about SGML projects at commercial airlines--Delta, United, BA,
|   Northwest, etc.?

Don't know if the individual airlines are doing anything, but the Air
Transport Association of America was working on DTD(s) to accompany ATA
Specification No. 100.  This is a commercial spec that is used by most
aircraft manufacturers for building their maintenance manuals, which then
go to the airlines when they buy a plane.  ATA's adrress is

    Air Transport Association of America
    1709 New York Avenue NW
    Washington, DC  20006-5206
    Phone: +1 202 626 4000

My copy of ATA 100 is a couple of years old so that might not be the
correct address or phone, but it should at least get you in the ballpark.
You might also want to check with the Aerospace Industries Association.
Most aircraft manufacturers are memebers of AIA.  Their address is:

    Aerospace Industries Association
    1250 Eye Street NW
    washington, DC  20005
    Phone: +1 202 371 8460
    Donald Calvert, Director of Standardization

Hope some of this helps.
</message>
<message id="<DMEGGINS.94May18122619@aix1.uottawa.ca>" date="2978267179">
Newsgroups: comp.text.sgml
Date: 18 May 1994 16:26:19 UT
From: David Megginson \<dmeggins@aix1.uottawa.ca>
Organization: Department of English, University of Ottawa
Message-ID: \<DMEGGINS.94May18122619@aix1.uottawa.ca>
References: \<Cpy5s7.1ww@unx.sas.com>
Subject: Re: Defining a Display in HTML

[Mark Britt]

|   The design of the tutorials is based on the premise that the learner
|   should see only certain information on the display, and to get more
|   information, s/he must press the \<Continue> button (or to go back to a
|   previous display, the \<Back> button).  In many cases, the tutorial
|   "builds" on information already displayed, as shown in the example
|   below.

To paraphrase Spock's comment on Kahn in one of the Star Trek movies, you
are showing signs of one-dimensional thinking.  While it is possible to
scroll a hypertext document, it is probably better to move to a new
document for new information (personally, I like HTML documents which fit
on a screen).  Try something like this:

  \<HTML>
  \<HEAD>
  \<TITLE>Tutorial\</TITLE>
  \</HEAD>
  \<BODY>
  \<H1>Fooing the Bar\</H1>

  \<IMG SRC="bar.gif">\<P>

  It can be somewhat tricky to foo a bar.  Try fooing this bar, then
  click \<A HREF="barred-foo.html">here\</A> when you are ready.\<P>

  \<ADDR>Mark Britt\</ADDR>
  \</BODY>
  \</HTML>

Hope this helps,

David
--
David Megginson                Department of English, University of Ottawa,
dmeggins@aix1.uottawa.ca       Ottawa, Ontario, CANADA  K1N 6N5
dmeggins@acadvm1.uottawa.ca    Phone: (613) 564-6850 (Office)
ak117@freenet.carleton.ca             (613) 564-9175 (FAX)
</message>
<message id="<Robbin.10.0011D180@[140.138.150.213]>" date="2978272138">
Newsgroups: comp.text.sgml
Date: 18 May 1994 17:48:58 UT
From: Robbin Wong \<robbin@csnext.yzit.edu.tw>
Organization: Dep. Computer Sci. & Information Eng., Chiao Tung Univ., Taiwan, R.O.C
Message-ID: \<Robbin.10.0011D180@[140.138.150.213]>
Subject: Are there any HyTime DTD or DI sample?

I am studing about HyTime standard,
and where can I get HyTime DTD and DI
sample?

Robbin Wong in Yan-ze institute, Chinese Taiwan
Mail addr: robbin@csnext.yzit.edu.tw
</message>
<message id="<9405181849.AA58090@source.asset.com>" date="2978275744">
Newsgroups: comp.text.sgml
Date: 18 May 1994 18:49:04 UT
From: Len Bullard \<bullardc@source.asset.com>
Message-ID: <9405181849.AA58090@source.asset.com>
References: \<Cpy5s7.1ww@unx.sas.com> \<DMEGGINS.94May18122619@aix1.uottawa.ca>
Subject: Re: Defining a Display in HTML

[David Megginson]

|   To paraphrase Spock's comment on Kahn in one of the Star Trek movies,
|   you are showing signs of one-dimensional thinking.  While it is
|   possible to scroll a hypertext document, it is probably better to move
|   to a new document for new information (personally, I like HTML
|   documents which fit on a screen).  Try something like this:

It is also possible that HTML is the wrong application for creating
tutorials.  Am I right in assuming that the use of HTML assumes the use of
an HTML DTD which fixes the data tagging of the instance?  If so, then the
chief advantage of HTML would be network access.  If Mr. Britt's
application does not require network access, than a more flexible
application for tagging and declaring displayable chunks might be more
appropriate such as IADS.  The advantage of IADS would be that using a
framing approach, the size of the chunk and the order of the displays are
very easy to specify.  Of course, it is an MS-Windows application.

If he requires a more sophisticated form, a commercial version with more
features and some HyTime support is available.

Len
</message>
<message id="<2rdrh6$tcl@sernews.raleigh.ibm.com>" date="2978279398">
Newsgroups: comp.text.sgml
Date: 18 May 1994 19:49:58 UT
From: Kevin Kizer \<kevik@vnet.ibm.com>
Organization: ISSC Southeast Region
Message-ID: <2rdrh6$tcl@sernews.raleigh.ibm.com>
Keywords: HTML AAP DTD
Subject: Need HTML and AAP DTDs

I am currently working on a SGML project and need a copy of the HTML and
AAP DTDs. Where can I find these DTDs? Is there a FTP address for this kind
of information? If not, could someone email the DTDs to me?

thanks,
Kevin Kizer
</message>
<message id="<Cq0MJ5.3JJ@ceco.ceco.com>" date="2978282128">
Newsgroups: comp.text.sgml
Date: 18 May 1994 20:35:28 UT
From: "Garry J. Garrett" \<garry@ceco.ceco.com>
Organization: Commonwealth Edison Company
Message-ID: \<Cq0MJ5.3JJ@ceco.ceco.com>
Keywords: Script SGML GML HTML conversion
Subject: Script -> HTML?

I have some documents in "Script", an IBM Mainframe product that uses a
mark up language called GML "Generalized Markup Language".  I think it is
similar, if not related to, SGML which seems to be talked about a lot in
this newsgroup.  I seem to hear a lot of people talking about converting
SGML to HTML (HyperText Markup Language).  Does anyone know of any
utilities to convert Script GML documents into HTML documents?  If you know
of anything to convert SGML to HTML, I'll try that and see if it works.
Thanks for any assistance that you can give.  I would prefer e-mail as I am
not sure that I will be able to keep up with this group in the coming
weeks.

--
##########################################################################
# Garry Garrett, garry@ceco.ceco.com
# (Me?  Speak for CECo?  I get into enough trouble speaking for myself!)
#
# Quotes I have known & loved:  :-)
#	Ask a Silly person, get a silly answer.
#	I used to be sane but I got better.
#	We are the Barney.  Resistance is futile.  You will be assimilated.
#	Nuke the Smurfs.  Nuke'em till they glow, shoot'em in the dark.
#	If Pro is the opposite of Con, and Progress is going forward...
#	Nature does not have laws; it has habits, and habits can be broken.
</message>
<message id="<9405182135.AA40862@source.asset.com>" date="2978285734">
Newsgroups: comp.text.sgml
Date: 18 May 1994 21:35:34 UT
From: Len Bullard \<bullardc@source.asset.com>
Message-ID: <9405182135.AA40862@source.asset.com>
Subject: Xenoforms in SGML/HyTime

Y'all (translate: type=DeepSouthernAmerican is-a="Whomever is interested"):

I'm soliciting commentary on the use of xenoforms (stranger or foreign
objects) in SGML/HyTime applications. The problem is:

1.  Hytime ILINKs or CLINKS are not processing forms.  They point to
    objects.
2.  In order to create an application that has third-level capabilities
    (e.g., can switch or link on a condition) the behavior of the
    conditions should be included with the SGML if one is to transport the
    data across systems.
3.  SGML is verbose when one tries to specify processing semantics and the
    LINK processing is rarely/never supported by vendors.  Furthermore, one
    has to question the advisability of defining yet another programming
    language in SGML.

I'm aware of efforts (eg, SMSL) to define standard multimedia scripting
languages, but I do not know if i will become available within the time
frame of existing projects.

I am aware that xenoforms reduce the portability of the files, but that is
true of all NOTATIONs used in SGML.

I'm looking for arguments on any side of this issue.  As others, notably
kevin Allan of InfoAccess have noted, the issue of how we specify behavior
in a document is becoming increasingly important.  We all know how to do it
with VisualBasic, but is that how we want to do it with SGML.  We can
certainly define a scripting language complete with loops and switches,
etc, but is that the best idea?  Would xenoforms that notate another
language (e.g., C, Scheme, what have you) be the most practical approach?

Clueless in Alabama.

Len
</message>
<message id="<2re4s0$pd1@daphne.ecmwf.co.uk>" date="2978288960">
Newsgroups: comp.text.sgml
Date: 18 May 1994 22:29:20 UT
From: Philippe Parmentier \<cdp@ecmwf.co.uk>
Organization: European Centre for Medium Range Weather Forecasts
Message-ID: <2re4s0$pd1@daphne.ecmwf.co.uk>
Subject: y

Title says it all, I am looking for a WordPerfect to HTML or to plain text
converter.

Any pointers (or tips to write one )?
--
 Philippe Parmentier	 E-mail : cdp@ecmwf.co.uk
 Post-mail: ECMWF, Shinfield Park, Reading, Berkshire RG2 9AX, U.K.
 Telephone: U.K. (0734) 499803		International +44 734 499803
 Telex: 847908 ECMWF G				Fax: .. 869450
</message>
<message id="<19940519.2304@naggum.no>" date="2978304016">
Newsgroups: comp.text.sgml
Date: 19 May 1994 02:40:16 UT
From: Erik Naggum \<erik@naggum.no>
Organization: Naggum Software; +47 2295 0313
Message-ID: <19940519.2304@naggum.no>
References: <24916@dog.cme.nist.gov> <19940515.2234@naggum.no> <2raob4$2ri@dancer.cc.bellcore.com>
Subject: Re: nesting entities

[Erik Naggum]

|   the attitude you present is the _old_ attitude -- where software
|   vendors and programmers owned the users' data, and thought they had
|   every right to dictate the terms of its use.  we want to get away from
|   that, and instead have the vendors and programmers understand that they
|   are nothing without the data we ask them to process.  it is _our_ data,
|   not theirs.

[Haim Kilov]

|   Exactly!  E.W.Dijkstra in his 1976 book "A discipline of programming"
|   (and perhaps even earlier than that) stated that at the beginning of
|   programming the goal of our programs was to instruct our machines, but
|   "now" we understood that the goal of our machines is to execute our
|   programs.

not sure I got the connection here.  did you mean that we have progressed
from the state where the goal of our information was to please programs, to
the state where the goal of our programs is to understand our information?
if so, I agree we have made the step regarding machines and programs, but
we are still a long way from making a similar step regarding programs and
information.

in the case of the programs, the "users" were the programmers.  they "won"
the battle with the hardware, and the engineer--programmer symbiosis works
well because the problem domain is well known.  (I'm glossing over areas
the size of the Pacific here for the purpose of comparison.)  when it comes
to the information, the "users" are non-programmers, and their problem is
they have no clue what the problem domain is.  most of them "just want it
to work", as if wanting will get you there.  the central issue at work is
that programmers are the kind of people who work hard to know how to tell
the stupidest entity in the known universe how to do something complex and
are willing to hold its hand every single minute step of the way, while
users are the kind of people who work not at all to know how to tell the
smartest people in the known universe how to do something vague and mushy
and give up when asked the simplest questions.

now, if you have been reading this group for a while, you will know that I
care very much about information, and information means human users at both
ends, but I also see the single biggest threat to the information being
those users who don't realize that computers are definite, specific things,
not the infinitely flexible stuff that their dreams are made of, and that
as any other tool in human history, can be applied wisely or not so wisely.

I am deeply concerned that what we, the programmers, have relayed to the
users through our great discoveries of what we can do, that "anything" is
possible.  "anything" to a programmer is not the "anything" a user will
think of.  users will ask for the impossible up front, while programmers
grasp the nature of the problems and won't even consider the impossible.

the conflict that lies in this area is probably only getting worse as we
get vastly more users, and although proportionally less, vastly many more
programmers, who are likely to be somewhere inbetween "user" and
"programmer" in this black-and-white picture.  the first casualty in this
conflict is the information.

if Dijkstra said this before 1976, upwards of 20 years have passed while
end users have remained largely unaware that there has been a change in
focus that they, too, should have grasped, but alas! have not.  instead,
they go for the sexiest user interface.

since users can't take care of themselves, somebody else has to.  standards
may serve this purpose, but only if _some_ users are willing to learn them,
and to ask intelligent questions of the programmers and the witch doctors
between them and you (that's the marketing folks).  history has shown that
it is completely beside the point whether a solution is technically elegant
or the quickest and dirtiest hack around, as long as users are taught that
they want the quick and dirty hack, and are taught, at every occasion, that
internals are too complicated for their puny little brains to grasp as long
as they have cash and accept the license agreement that absolves the vendor
of all responsibility for quality, maintenance and fitness for use.

who are the people who _could_ take care of the users and the information?
by a simple process of elimination, we're left with the programmers.  will
the programmers who conquered the hardware relinquish control to the users?
not likely.  nor should we attempt to make them listen to vague users.
instead, they have to learn, from their earliest experience with computers
that the _information_ is the sine qua non of computing.

paradoxically, the best way to accomplish this is through intelligent user
interfaces that do _not_ gloss over the internals with the silly excuse
that it is too complicated for users to understand.  the challenge is to
make users _aware_ of the internals so that they can ignore them in that
sense of certainty and confidence that only comes from knowledge.

as a parallel, consider the demands of the Food and Drug Administration on
the documentation on foodstuff.  users are the people who go "yuck!  this
is full of _ingredients_".  there are regulations and requirements on the
sale of food that make any computer industry standard look like child's
play, and lethal toxins like tobacco require a warning from the Surgeon
General.  you would have thought people would have got the idea by now?
but, no.  they still use MS-DOS and they still accept crashing systems that
destroy their information, partly because they fed it virus-infested
diskettes or programs.

notice how conveniently this fits in with SGML.  declare each part of the
document, declare the document type, declare the notations used, declare
the processing capability required to process it, declare character sets
and toxic codes.  all computer files should come with a list of ingredients
and recommended daily allowance like this.  maybe then we can get users to
think in terms of _information_health_, instead of today's junkware.

thanks for listening to my rambling.

best regards,
\</Erik>
--
Erik Naggum \<erik@naggum.no> \<SGML@ifi.uio.no>  |  memento, terrigena.
ISO 8879 SGML, ISO 10744 HyTime, ISO 10646 UCS  |  memento, vita brevis.
</message>
<message id="<CONNOLLY.94May18234032@austin2.hal.com>" date="2978311232">
Newsgroups: comp.text.sgml
Date: 19 May 1994 04:40:32 UT
From: Dan Connolly \<connolly@hal.com>
Organization: HaL Computer Systems, Inc.
Message-ID: \<CONNOLLY.94May18234032@austin2.hal.com>
References: <16FB8107D6S85.UPP201@ibm.rhrz.uni-bonn.de> <1994May17.083731.16013@leeds.ac.uk>
Subject: Re: SGML, HTML and LaTeX

[Michael Burshik]

|   There seem to be some tools that convert SGML or HTML documents into
|   LaTeX for printing.  What is the most recommendable package?  Are there
|   any tools for converting LaTeX files into SGML or HTML?

As to SGML to LaTex, check out the QWERT dtd and the sgml2latex-format
system, available at the ifi.uio.no ftp archive (I beleive... I'm having a
low-bandwidth evening, or I'd verify the reference myself)

[Nikos F Drakos]

|   Try http://info.cern.ch/hypertext/WWW/Tools/
|   for a list of conversion tools to and from HTML.

This is a real hodge-podge of stuff.  Caveat emptor.

|   More information on LaTeX2HTML is available at
|   http://cbl.leeds.ac.uk/nikos/tex2html/doc/latex2html/latex2html.html

I highly recommend this tool -- it seems like a good tools that not only
generates HTML, but provides much needed structure (TOC, index, that sort
of thing) for aggregate HTML documents.

I would like to see latex2html generate conforming SGML documents, but
until there is a stable DTD for HTML, I cannot blame the developer for the
fact that the HTML it produces isn't valid -- especially considering the
fact that once the tool is revised to produce conforming documents, you can
just regenerate all your documents.

By the way... see

	http://www.hal.com/%7Econnolly/html-test/html.dtd
		and html.decl
		and README.html

for info on an HTML DTD, which will be released as part of a new HTML spec
Real Soon Now.

And for the pedants, yes, the spec starts out with:

	HTML
	An Application Conforming to ISO 8879 -- Standard Generalized
	Markup Language

No bones about it, this time around!

--
Daniel W. Connolly        "We believe in the interconnectedness of all things"
Software Engineer, Hal Software Systems, OLIAS project   (512) 834-9962 x5010
\<connolly@hal.com>                   http://www.hal.com/%7Econnolly/index.html
</message>
<message id="<Cq1u15.I2@unx.sas.com>" date="2978338505">
Newsgroups: comp.text.sgml
Date: 19 May 1994 12:15:05 UT
From: Mark E. Britt \<sasmsb@unx.sas.com>
Organization: SAS Institute Inc.
Message-ID: \<Cq1u15.I2@unx.sas.com>
References: \<Cpy5s7.1ww@unx.sas.com> \<DMEGGINS.94May18122619@aix1.uottawa.ca> <9405181849.AA58090@source.asset.com>
Subject: Re: Defining a Display in HTML

[Len Bullard]

|   It is also possible that HTML is the wrong application for creating
|   tutorials.  Am I right in assuming that the use of HTML assumes the use of
|   an HTML DTD which fixes the data tagging of the instance?  If so, then the
:   
|   If he requires a more sophisticated form, a commercial version with more
|   features and some HyTime support is available.

Since my original posting, I've come to believe that I am trying to use
HTML for something it wasn't designed to handle -- a multimedia
application.  After further investigation, I think I need to use the HyTime
standard. After all, what I'm creating is a multimedia application;
defining the display is only one of the problems not addressed by HTML or
SGML.

Like you, Mr. Bullard, I'm also interested in hearing from people who have
been using HyTime to develop multimedia applications.

Thanks for the responses.
--
Mark E. Britt | sasmsb@unx.sas.com | The lust for comfort murders the
CBT Developer | (919)677-8000x6497 | passion of the soul, and then
SAS Institute | Opinions are mine. | walks grinning in the funeral.
                                     - Kahlil Gibran, "The Prophet"
</message>
<message id="<2rfll4INNedp@oasys.dt.navy.mil>" date="2978338916">
Newsgroups: comp.text.sgml
Date: 19 May 1994 12:21:56 UT
From: Betty Harvey \<harvey@oasys.dt.navy.mil>
Organization: Carderock Division, NSWC, Bethesda, MD
Message-ID: <2rfll4INNedp@oasys.dt.navy.mil>
References: <1994May16.145935.9648@xgml.com> <9405172030.AA37387@source.asset.com>
Subject: Re: Tool Limitations

[Len Bullard]

|   Erik, its good to be powerful and knowledgeable and Lord Knows I also found
|   the GUIs to be a fine way to slow down a power user, somewhat similar to
|   QWERTY/DVORAK keyboard controversy.  However, lovers of the harpsichord
							       ^^^^^^^^^^^
I still stand by the harpsichord it makes beautiful music.

Working in a computer user support organization for a long time I learned a
very valueable lesson that I will pass on |-).  A tool that I use for
specific application might not be the best tool you person to use.  There
plusses and minuses to each and every tool for every application, i.e.,
Word Processing, Spreadsheets, Databases.  SGML tools (in my opinion) are
currently in the infant stage and moving quickly into childhood.

From my experience, thus far, I haven't found an SGML tool that I could
endorse.  However, there are good and bad features about each and every one
of them.  On the other hand, I wouldn't suggest that someone who is just
learning and working with SGML work in an ASCII environment to create SGML
documents.  When SGML editors work they keep the author 'honest'.

I am looking forward to the future when SGML tools become more mature and
less costly.  Tools like Microstars Near and Far, in my humble opinion, are
helping to create a less hostile environment for new and future SGMLists.
A picture can tell a thousand words and it is great to see the complete
hierarchy of a DTD content.  It is important for us (in the SGML world) to
be advocates of the richness, useability and effectiveness of SGML.  Unless
the proper tools are available this will not be an easy task.

However, applications like the WWW are and will help tremendously.  I have
been giving many demonstrations recently on the World Wide Web.  One of the
items about this great technology that I stress strongly is that it is
based on opens systems values -- SGML being the strongest.  HTML might not
be best written SGML, but it wasn't developed by SGMList.  But look at what
has happened.  As the WWW is maturing and SGMList are getting involved,
HTML is maturing.  When the non-SGMList see the WWW in action they are
awestruck.  When they hear the SGML is the way the data is transferred,
they don't care if it is good or bad SGML -- all they know is that it is
working in a World-wide environment and that anyone can create documents
and information in an open environment.

I'll get off the soapbox now and I hope I haven't bored anyone.  Vendors
keep up the good work.  Workers in the trenches are anxiously waiting.

--
Betty Harvey  \<harvey@oasys.dt.navy.mil>     | David Taylor Model Basin
Advanced Information Systems Branch          | Carderock Division
Code 183                                     | Naval Surface Warfare
Bethesda, Md.  20084-5000                    |   Center
(301)227-3348   FAX (301)227-3343            | DTMB,CD,NSWC
</message>
<message id="<9405191421.AA22385@source.asset.com>" date="2978346091">
Newsgroups: comp.text.sgml
Date: 19 May 1994 14:21:31 UT
From: Len Bullard \<bullardc@source.asset.com>
Message-ID: <9405191421.AA22385@source.asset.com>
References: \<Cpy5s7.1ww@unx.sas.com> \<DMEGGINS.94May18122619@aix1.uottawa.ca> <9405181849.AA58090@source.asset.com> \<Cq1u15.I2@unx.sas.com>
Subject: Re: Defining a Display in HTML

I'm not exactly sure what the limits or goals of HTML are.  I've read a
design document for it, but truthfully, haven't tracked it as I have not
yet been privileged to visit a site where it was active.

Every new developer of hypermedia typically fixes the data types early in
the development of the system.  This is what some of us call "hardwiring
the data type".  This is not necessarily a "bad" practice.  It is
convenient for the programmer.  On the other hand, it can be inconvenient
for the information designer (read: DTD hackmeister |-) ) who wants to
model the information based on other criteria than how a paint command loop
wants to see it.  Also, when embedding behavior (third level directed
navigation) into the information, it is often advantageous if the
representation (encoding) were not bound to the presentation (a la RTF).
When the DTD is fixed, one gets a WYSIWYG or RTF approach.

It is convenient if the network protocol is entirely separate from the data
representation.  Perhaps this is wishing for the moon.  However, local
conditions (data at site under use) and global conditions (data on net
under transport) must be separable.  This is the flexibility that is
promised by SGML, but often abdicated by the programmer who works within
the constraints of hardware and operating systems over which the programmer
has little control.

There are a lot of issues that basically resolve to the discussion of
addressing and entropy (for those of you who think that is too abstract,
revisit Claude Shannon's work).  These issues have to be resolved by the
designers when they scope the mission of a system.  There ain't no magic
... just design trade-offs and the loop between system and environment.
SGML bridges that gap by providing a lot of adaptability in the
representation of information.  When I teach SGML, I prefer to have a
student code a spice rack instead of a document type.  By this exercise,
the student establishes a view of SGML that is not too closely coupled to
headers, footers, paragraphs and line feeds.  instead, they are taught to
understand the application of abstract data types against which processes
can be posted.  SGML is a markup language.  Finis.  It is the model in the
mind that must be expressed and this can only be accomplished when the
designer can achieve that level of abstraction which can then be expressed
in SGML.  Finding a tool that allows this level of abstraction and has a
rich enough function library that can be easily assembled into 'aggregate'
forms is the real challenge of the SGML users and vendors.

For example, IADS allows you to create a multi-tiered frame-based
environment in which a limited amount of if/then processing is possible.
Direct multiple-choice selection for navigation and embedding of buttons is
very easy to do.  Graphics can be embedded in the text and these too can be
"hot".  Links to other documents and executables are very easy to build.
There are no requirements for tagging that seriously constrain the designer
other than using the frame and file tags, and that is not too large a
burden.  What doesn't it do well?  It doesn't allow a writer to embed a
dialog that elicits information from the user of the application, or give
the user expression operators.  Can a user get around this?  Yes, by
building Visual Basic applications that direct the access to the frame
file, its very simple to create third level capabilities that are required
for more advanced tutorial work.  Is that a limitation?  It chiefly depends
on cost and which parts of the information must be portable.  And so it
goes...

Even if you use HyTime applications, you will have only solved one part of
the problem.  HyTime is still not the answer for embedding behavior in the
model.  At this point, it seems we still have to resort to xenoforms (e.g.,
embedding Visual Basic or C as foreign notations in the SGML instance) if
we want to represent conditional processing (see ISO 10744 10.5 Objects).
So, although I am a supporter of HyTime, I would still have to tell a
customer that they may not be far beyond the capabilities of IADS should
they buy a HyTime-compliant systems.  If standardized linking is not their
primary concern, then an application like Visual Basic that provides a
script-driven display and event handling (as in event loop) will still be
needed.

I am sure that the next year will see applications that are:

1.  Flexible - aggregate-oriented will appear.  Then we will have to be
    more precise when asking for features, that is, really sure of what we
    want to do with the system.  In some circles, that is called, defining
    the mission.

2.  Beyond the Book Metaphor -- don't force us to treat behavioral
    information as if we were a finger tracing the index or table of
    contents to locate a topic.

3.  CHEAPER!  If vendors continue to price SGML applications like mainframe
    software, while MS-Windows products that deliver 85% of the capability
    for one tenth of the price are proliferating, SGML will go the way of
    COBOL.  Still in use, but regarded as clumsy and unloved by most of the
    world.  International standardization cannot ensure use.  We buy what
    we can afford, even if that means we have to buy something else a few
    years down the road.  Witness the number of Volkswagen Beetles still on
    the road.

Meanwhile, I must go back and add another slot to the spice rack.

Cheers,
Len Bullard
</message>
<message id="<9405191445.AA30701@source.asset.com>" date="2978347542">
Newsgroups: comp.text.sgml
Date: 19 May 1994 14:45:42 UT
From: Len Bullard \<bullardc@source.asset.com>
Message-ID: <9405191445.AA30701@source.asset.com>
References: <1994May16.145935.9648@xgml.com> <9405172030.AA37387@source.asset.com> <2rfll4INNedp@oasys.dt.navy.mil>
Subject: Re: Tool Limitations

Good to finally hear from you Ms. Harvey! |-) Harpsichords do indeed make
beautiful music and in surprisingly diverse styles unless you need more
controlled sostenuto.  On the other hand, when out into an amplified
environment, its simply easier to use the synthesizer.  The point: you are
quite right.  All of the tools have limitations, strengths and habits, and
all of these combine to produce cultures and cultural biases.  HTML appears
to be a marvelous application and I salute those who have labored hard to
build it.  But as Mr. Britt has discovered, it is yet another tools
appropriate to that which it was designed to do.  As long as its culture
understands this, peace.  Otherwise, we end up having those rancorous
debates that have almost destroyed the 28001 effort in which one tag bag
was supposed to be a final word.

In short: A picture is only worth a thousand words if you already have a
thousand words to trade.  One must recognize the picture.  Show a picture
of a tree to a fish.  Good and bad in SGML is like good and bad in most
things: a matter of position in time and space, and a set of private
beliefs acquired from culture and introspection.  And a self-determined
relationship to dimensions of existence both seen and unseen, felt and
unrealized.

"Science without philosophy is like sex without love: stimulating, but
unsatisfying and occasionally degrading."

As yet another worker in a trench, my regards and best wishes.

Len Bullard
</message>
<message id="<1994May19.150823.17879@dxcern.cern.ch>" date="2978348903">
Newsgroups: comp.text.sgml
Date: 19 May 1994 15:08:23 UT
From: Eric van Herwijnen \<evh@dxcern.cern.ch>
Organization: CERN European Laboratory for Particle Physics
Message-ID: <1994May19.150823.17879@dxcern.cern.ch>
Subject: SGML year in review [posted on behalf of Pam and Yuri]

The SGML World Report/Year in Review

Being the opening presentation of SGML Europe '94, Montreux, Switzerland,
May 16, 1994.

Pamela Gennusa, Database Publishing Systems Limited
Yuri Rubinsky, SoftQuad Inc.

(c) Copyright 1994 Pam Gennusa and Yuri Rubinsky
This article may be copied in whole or in part provided that this copyright
notice not be removed.


DISCLAIMER: As always, this is a highly personal whirlwind tour and
reflects a somewhat idiosyncratic collection methodology: The Year in
Review comprises more or less whatever someone mails us or tells us or
hands us on a slip of paper on a tourist train to Gstaad the evening before
the GCA SGML Europe '94 Conference, during the last free time anyone will
have for four long and productive days.


1. STANDARDS ACTIVITIES

DSSSL, ISO/DIS 10179 -- Document Style Semantics & Specification Language
     Sharon Adler reports that Draft International Standard ISO/DIS 10179,
     Document Style Semantics & Specification Language, otherwise known as
     DSSSL, will be going out for its second DIS ballot later this Spring.

ISO 12083 -- reincarnation of AAP Electronic Manuscript Project
     Last December marked the formal announcement of ISO 12083, the
     much-enhanced and internationalized version of the ANSI/NISO standard
     Z39.59-1988, the American National Standard for Electronic Manuscript
     Preparation and Markup (also known as the AAP DTD).  The ISO standard
     differs from the ANSI standard in that all ambiguities, redundancies
     and formatting specific aspects of the original have been removed;
     element and attribute names and values have generally been lengthened
     for increased legibility; and simple constructs for support of HyTime
     and publication for the print-disabled have been incorporated into the
     DTDs.

STEP, Standard for Exchange of Product Data, ISO 10303 - SGML Activity
within STEP
     In a world quite far from the one in which most of us work most of the
     time, engineers use computer-aided design software to aid in the
     manufacturing process.  Generally, documentation is a separate
     activity.  Now, however, as the interchange of product data becomes
     more and more important so too does the requirement for the
     interoperability of the engineering data and the technical
     publications.  Standardization of this relationship falls under ISO
     10303 -- STEP, working group 14.  The activity is at the very early
     stages and has a theoretical approach to both the encoding and the
     transformations that must take place on product data during the
     production publishing process.

2. MAJOR PUBLIC APPLICATIONS

AITF -- Art Industry Task Force
     A collection of museums and major art galleries are collaborating to
     produce an SGML application from which catalogues and collection
     databases can be derived.  The DTD is in late draft stages and was
     developed by David Durand of Boston University.

CALS -- Continuous Acquisition and Life-cycle Support
     CALS Test Network now has 45 CALS conforming DTDs available.
     Highlight of the year for CALS was changing the acronym once again.
     (Most recent previous version: Computer-Aided Acquisition and
     Life-cycle Support.)

CApH
     At the CApH meeting last week, the group developed a facility for the
     rigorour expression of semantic assignments for elements, attributes,
     architectural forms and anchroles.  At the same time, the CApH group
     is recasting its work as a set of semantic declarations.  CApH, by the
     way, stands for ``Conventions for the Application of HyTime''.

Davenport Group
     Version 2.1 of the DocBook DTD -- a DTD for computer documentation --
     is being implemented by several members of the Davenport Group
     (including Novell, Digital, and Uniscope).  Development is focused on
     fixing bugs turned up by implementation.  Math DTD fragments are being
     tested for possible adoption in DocBook while discussions continue on
     deleting some inline tags, on linking and generated text, on
     annotations, and on the organization of meta information for objects.
     By now all the documentation for UNIX System V Release 4 is marked up
     in DocBook and Uniscope reports that the first Japanese-language
     application of the DocBook DTD for software manuals has been
     completed: They have delivered 4,000 pages of Novell documentation
     on-line and in print.

ICADD -- International Committee for Accessible Document Design
     A meeting was held last week at Recording for the Blind in Princeton,
     New Jersey where some 30 interested specialists from around the world
     began the work of ensuring the interoperability of SGML math encodings
     with Braille math and new projects for spoken math.  One of the
     suggestions is to treat the ISO 12083 math -- which is optimized for
     display -- essentially as a set of Architectural forms for more
     semantically-oriented elements.

     The European Union-funded project, CAPS, is completing its development
     of a prototype electronic library for the blind using SGML documents
     and the ICADD mechanism to make these documents accessible for the
     blind.

IPTC (International Press Telecommunications Council) and NAA (Newspaper
Association of America)
     Development of a Universal Text Format DTD for the news industry is
     proceeding under the auspices of the IPTC and NAA.  A joint committee
     of the two organizations continues to revise the DTD and prepare
     supporting documents.  The UTF would be used primarily by wire
     services, newspapers, and archival database providers.  The working
     group includes representatives from Reuters, Agence France Press,
     Deutsche Press Agency, the Associated Press, UPI, the New York Times,
     Mead Data Central, and Dow Jones.  The committee hopes to distribute
     draft introductory material in June.

Pharmaceutical Industry
     There will soon be an official announcement of a `Memorandum of
     Understanding' between the US Food and Drug Administration, the
     Canadian Health Protection Branch, and the Medical Products Agency in
     Sweden.  These participants have agreed to collaborate in the
     development of DTDs, and in defining other appropriate interchange
     standards for electronic regulatory submissions to the three
     countries.  They also hope to expand their efforts to other
     international bodies.  This has already been announced at several
     public meetings in the US.

Pinnacles Group
     The Pinnacles Group released Draft 1.0 of the Pinnacles Component
     Information Standard (PCIS) for limited review.  This standard for the
     interchange of electronic component information is intended for use
     throughout the semiconductor industry.  A review session was held
     27-29th April and the Group expects to release 1.1 in early August and
     begin bench mark testing this Summer.

TEI -- Text Encoding Initiative
     The long-awaited and much-looked-forward-to publication of P3, the
     1,300-page TEI Guidelines for Electronic Text Encoding and Interchange
     is complete.  Years and years of work has resulted in a formal
     document type definition for the textual features most important to
     the use of electronic text in academic research.  Publication is in
     paper and on disk.

World Wide Web and HTML
     The world's largest and most publicized SGML activity has come into
     being in that ethereal zone called the Internet.  The World Wide Web
     -- a collection of protocols developed at CERN -- has given rise to a
     powerful set of capabilities running over the Internet.  These may now
     be accessed through a variety of software applications which use a
     very simple SGML tagset known as HTML.  The most famous of the
     browsing tools, called Mosaic, is freely available from an FTP site at
     the US National Center for Supercomputing Applications in Champaign,
     Illinois.  The software is being downloaded some 3000 times a day with
     over a million copies now in use.  A more robust and useful DTD,
     called HTMLplus, is now being created and Mosaic is being upgraded to
     support it.

3. USER GROUP ACTIVITY

[This section lists all chapters but only reports on the newest
ones.]

European Chapters:

Belgian-Luxemburg
     SGML BeLux is a new chapter, founded this past year.  They have had
     their first conference with 37 attendees; produce a local newsletter,
     have a second conference planned for October, and have 72 individual
     members and 73 corporate members.

British

Danish

Dutch

Finnish
     The SGML User Group in Finland was founded this past March.  The User
     Group already boasts about 40 members.

French

German

Italian

Norwegian

Swedish
     The Swedish SGML Users' Group had its first regular annual meeting in
     March'94, with approximately 150 participants.  This Fall the Swedish
     Users' Group will arrange two workshops, one on HyTime and the other
     on SGML in publishing.

Swiss
     The Swiss group started the activity of their Hypertext Working Group
     with the ``Hypertext and the future of text'' conference held on the
     26th of April.  The SGML Users' Group Switzerland (SUGS) newsletter is
     now edited in SGML and published with DynaText.  This new presentation
     allows for the delivery of complete information pieces rather than
     abstracts.  For the future, each new issue of the newsletter will
     consolidate the previous one, providing access to an increasing set of
     information that should become a reference document.

North American Chapters

Arizona

Canadian

Mid-Atlantic

Midwest

New York

Northern California
     The Northern California chapter is in the final stages of drafting its
     constitution and bylaws and is preparing to mail out its first call
     for members.

Rocky Mountain

Southeast

Southern California

Asian Pacific

Australian

Special Interest Group

European Workgroup on SGML

SIGhyper
     Erik Naggum has succeeded Steve Newcomb as Chairman of the SGML
     SIGhyper group, the long name of which is ``The SGML Users' Group's
     Special Interest Group on Hypertext and Multimedia''.

Joan Smith
     Last year we announced the retirement of Joan Smith, the founder of
     the SGML Users' Group.  This year we add a post script to that
     announcement to say that the Group has presented Joan with a silver
     tray to commemorate her contribution to the Standard and the Users'
     Group.  The inscription reads ``To Joan Smith With Thanks For Your
     Untiring Efforts, SGML USERS' GROUP 1994''

4. CORPORATE AND GOVERNMENT INITIATIVES

Aerospatiale
     Aerospatiale, the provider of Airbus and ATR documentation, has always
     been deeply involved in the international standardization process
     (including participation in the working groups and leadership of
     several of them) and is now able to produce data in conformance with
     the Air Transport Association (ATA) standards, but also to provide
     airlines with tools for exploiting these data.  Aerospatiale has
     developed a complete ground-based system ADOC, a documentation
     engineering workbench that integrates computerized data compliant with
     CALS and ATA standards, processes the information and outputs it,
     either in hard copy or on electronic medium.  Aerospatiale has also
     designed the Ground Consultation System (GCS), the complementary tool
     to immediately access the documentation.  Air Inter (Air France Group)
     has been a launching customer for ADOC for over a year and has used it
     to release its A330 electronic technical documentation.  The first
     industrial release of ADOC and GCS is in May 1994.

Airbus
     Since 1993, AIRBUS has been delivering digital aircraft technical
     publications according to ATA/AIA digital data standards to interested
     customers.  As a consequence, airlines such as Air Inter, Lufthansa,
     All Nipon Airways, and United Airlines have been provided with SGML
     documents for AIRBUS A340, A330, or A320 aircraft and includes
     Aircraft Maintenance Manuals, Illustrated Parts Catalogues
     (representing 20,000 to 30,000 pages each), and operating manuals
     (Flight Crew Operating Manual and Master Minimum Equipment List).
     This revolution in the area of civil aircraft support enables airlines
     to manage, process, distribute, and consult electronic standard
     technical publications.

Australian Department of Defense
     The Australian DoD has hired consultants to examine the application of
     SGML for the implementation of the Department's CALS strategy.  CALS
     will be an integral part of its future capital equipment acquisition
     program and fundamental to effective through-life support for those
     weapons.  The department's contracts for new systems are already
     demanding that companies deliver technical documentation to a CALS
     standard.

British National Corpus
     British National Corpus, a UK government-funded academic/industrial
     consortium, is developing a 100 million word corpus of modern English
     for use in lexicography and linguistic research.  Completed in April
     1994, this corpus is marked up in SGML, including part of speech codes
     and will be freely available for research purposes, together with a
     high performance SGML-aware browser/indexer developed for the project.

Brooker's
     Brooker's has adopted the use of SGML for legal publishing about ten
     months ago, converting the Statutes and Regulations of New Zealand
     into SGML.

Bureau Veritas
     The Marine Branch of Bureau Veritas has selected SGML as the standard
     representation format for their Rules and Regulations for the
     Classification of Ships (a 1000+ page document containing more than
     5000 math formulas, many tables and graphics) and contracted AIS for
     automatic up-conversion of this large document to SGML, starting from
     Autologic MOPAS typesetting code.  This SGML database will now be used
     as the single source for printed material and for preparation of a
     CD-ROM, to be completed by year end.

Butterworth Legal Publishers
     Butterworth Legal Publishers are moving some of their documentation to
     SGML.  Their initial project was a set of State administrative
     regulation rules created by the various State agencies such as Fish
     and Game, Motor Vehicles, Environmental Protection, etc.  The set
     consists of 22 titles (10,000 pages).

Canadian Strategic Software Consortium (CSSC)
     Open Text and Fulcrum are leading a consortium of Canadian software
     vendors and VARS to integrate full text capabilities with relational
     databases.  It is hoped that the project will be supported with
     matching funds from the Canadian government.

Editions Legislative
     Editions Legislative is the third major French legal text publisher to
     adopt SGML.  SGML will be used for several new projects, including the
     conversion of their existing French Social law database for the
     Dictionnaire Permanent Conventions Collectives to SGML for electronic
     and paper production.

Electricité de France
     Grif reports that Electricité de France, one of the world's largest
     electric utilities, has taken a new tack in its efforts to extend the
     life cycle of maintenance documentation for its nuclear power
     stations.  With s pilot project now behind them, EDF will use SGML for
     all its nuclear power stations.  EDF intends to use of SGML by asking
     suppliers to provide documents in SGML so that they can be
     incorporated directly into the IT system without having to convert
     them.

Flammarion
     Flammarion has selected SGML as the standard representation format for
     their new Dictionnaire de Geopolitique, a 2000-page encyclopedia
     incorporating many maps.  Both the printed version and a DynaText-
     based CD-ROM, including many generated indexes, were 100%
     automatically produced from this SGML database.  Geopolitics being a
     rapidly evolving field, a new edition will be produced every year.

Ford Motor Company & General Motors
     Ford Motor Company and General Motors (GM) demonstrated their new
     interactive GUIDE-based service bay information and diagnostic systems
     to a crowd of over 25,000 automotive industry attendees at the annual
     North American Dealer Association (NADA) show.  Ford and GM are of
     particular interest because they are following the J2008 DTD.

Institute of Electrical Engineers
     The Institute of Electrical Engineers has appointed Pindar Infotek to
     produce its first electronic journal making its learned works
     available via an on-line service.  They will transmit the database to
     the host company OCLC in Ohio, USA, who will make the service
     available to subscribers.  The database will be restructured using
     SGML, but for the first time the database, once created, will also be
     used directly to make up pages for print.

McGraw Hill
     McGraw Hill's Encyclopedia of Science and Technology has been released
     on CD-ROM, based on an SGML application.

NIPDE
     The US National Initiative for Product Data Exchange is leading a
     project to build prototypes of production environments within the
     manufacturing sector that will explore and demonstrate the
     interoperability of SGML and STEP.

PharmaSoft
     PharmaSoft is launching a Drug Dossier Manager, DDM, based upon SGML
     for the management of documents in pharmaceutical companies.  The
     first version is scheduled for the end of 1994.  DDM is based upon a
     knowledge base from which New Drug Applications (NDAs) or Product
     License Applications (PLAs) or any other regulatory document can be
     created.  DDM can print an NDA or PLA or provide a CANDA or CAPLA
     (that is, computer-aided versions of each).  PharmaSoft is also
     launching an add-on software to the SWEDIS Package.  The new package,
     REGULATOR, is SGML based and designed for Drug Regulatory Authorities.
     REGULATOR manages all knowledge within an Authority.

Pindar Infotek Ltd
     Pindar Infotek reports that the value and variety of their SGML back
     and front file conversions has increased 4 fold in the past 12 months.
     Projects include managing the capture of 2 30 volume dictionaries: The
     Macmillan Dictionary of Art which contains 25 million words and the
     Grove Dictionary of Music which as 20 million.

Shell U.K. Exploration and Production Ltd
     Shell U.K. Exploration and Production, operators in the U.K. sector of
     the North Sea on behalf of Shell and Esso, have appointed Pindar
     Infotek to convert existing operating standards and procedures
     documents into SGML.  These converted documents are then to be loaded
     into their Electronic Document Management System (EDMS) database so
     that they can be distributed as hypertext documents to 29 offshore
     installations.

Swedish Defence -- FMV Grund-DTD
     Swedish Defence is developing a content-based DTD, the first version
     is due to be finished by the end of this month, May 1994.  This DTD
     will be used to create technical information in accordance with the
     structure of the product data.  One of the Wednesday afternoon
     workshops will be devoted to this topic.

US Air Force
     The U.S. Air Force has developed several Documentation Sets, which
     consist of a DTD, FOSI, tagged instance, and an output of a sample
     technical order, to serve as test data for output systems, and provide
     an illustration of the purpose of the CALS effort.  They have released
     MIL-M-38784D for coordination in early April and it should be
     available this Fall.  Lastly, the U.S. Air Force has established an
     email group devoted to FOSI issues, called FOSI-NEWS.

US National Institute of Standards and Technology
     The US Department of Commerce National Institute of Standards and
     Technology (NIST) in Gaithersburg, Maryland is using SGML to implement
     intelligent structure-based access to documents associated with ISO
     10303, the Standard for the Exchange of Product Model Data (STEP).
     This SGML implementation is part of a larger National PDES Testbed
     (NPT) Program called the Application Protocol Development Environment
     (APDE).  Application Protocols (APs) are Parts of STEP specifying what
     industrial context (e.g., AP203: configuration controlled 3D
     modeling).  The APDE is an integrated software environment for STEP AP
     developers to support quicker development of high quality APs.  The
     APDE's core will be a central information base at NIST consisting of
     STEP-related information, models, documents and other data.  It is the
     intent of the APDE to provide SGML representation for all STEP
     component documents.  Development of NIST's SGML environment for STEP
     AP developers is now underway.  A DTD for STEP Integrated Resources,
     documents defining generic constructs (such as `geometry') that are
     incorporated into APs, has been written and a DTD for APs is currently
     under development.  Several STEP documents have already been tagged in
     SGML and a prototype SGML authoring environment for STEP has been
     implemented.

US Government Printing Office
     A federal agency forum has been started by the GCA to expand the use
     of SGML within government agencies.  The US Government Printing Office
     is going to publish the Federal Register and the Congressional Record
     on CD-ROM using SGML.

US Navy
     Unisys, Huntsville is leading an effort for the US Navy David Taylor
     Model Basin (DTMB) to create an SGML/HyTime DTD for View Packages (the
     delivery view of a collection of technical information on a weapon
     system).  This document type definition will complete the electronic
     pathway for interactive documents that are specified by MIL-M-87268
     and MIL-D-87269 (the military specifications for Interactive
     Electronic Technical Manuals).

Volvo
     Both Volvo Car Corporation and Volvo Truck have announced the
     electronic distribution of dealer information.  Authoring is done in
     SGML and information is stored in a database.  Volvo Truck is
     distributing on CD and paper.  Volvo Car Corporation documentation
     distribution is via CD in proprietary format and in SGML using J2008.

VTT
     VTT (Technical Research Center of Finland) has studied SGML since 1991
     in various research/feasibility projects and is currently implementing
     SGML in an extensive CALS pilot project.

5. NEW & FORTHCOMING PUBLICATIONS

Press Coverage
     North American coverage of SGML in the mainstream press continues to
     grow with mention in articles in Business Week, US News & World
     Report, Forbes, and even a letter to the editor in Sciences Magazine.

Prentice Hall Charles F. Goldfarb Series on Open Information Management
     Developing SGML Document Type Definitions, Eve Maler and Jeanne El
     Andaloussi, a guide to designing and implementing custom DTDs

     Practical Hypermedia: An Introduction to HyTime, W. Eliot Kimber, an
     introduction to HyTime for developers interested in using it to
     develop hypermedia applications.

     SGML on the Block, Pamela Gennusa, focuses on the acquisition and
     delivery processes surrounding electronic documentation, specifically
     using SGML.

Wolters Kluwer Academic Publishers
     Making Hypermedia Work: A User's Guide to HyTime, Steve DeRose and
     David Durand

     Practical SGML -- 2nd Edition, Eric van Herwijnen, an updated and
     expanded edition of this 1990 publication.

SGML CD-ROM Resource
     SoftQuad has recently completed a CD-ROM entitled the SGML World Tour
     which contains dozens of articles about SGML, excerpts from existing
     and forthcoming SGML books, and a hypertext version of nearly 4000
     articles from the comp.text.sgml newsgroup.  The disk runs on Windows,
     DOS, Macintosh, and Sun UNIX machines and the introductory materials
     are included in both Braille and a computer voice accessible version.
     The disc marks the first public use of HyTime: Links in both the
     Windows and UNIX versions of the contents -- which run on different
     browsers -- were generated automatically from the same files.

6. INDUSTRY NEWS

AIS S.A.
     Balise, the SGML application language first introduced by AIS during
     last year's SGML Europe conference, is now available in version 2.2.
     New features include enhanced pattern-matching and replacement
     primitives, non-SGML stream processing, user-defined datatypes and a
     Windows DLL version for OEMs.  This year, AIS announced several new
     products in the Balise family, including connectivity with native SGML
     databases and coupling with EBT's DynaText technology.

ArborText, Inc.
     The 1994 release of ADEPT Editor and Document Architect delivers a
     fully supported native Windows 3.1 implementation of ArborText's ADEPT
     Series and will be shipping in June of 1994.  ArborText has announced
     PowerPaste, an interactive conversion and translation workbench for
     handling complex data conversions into SGML.  PowerPaste runs on
     leading UNIX workstations including DEC 2100, 3100, & 5000, DEC Alpha
     AXP, HP 9000, IBM RS/6000, Silicon Graphics Iris & Indigo, and Sun
     SPARCstations.  At the fall 1993 Seybold Conference, the ADEPT series
     won an Award for Excellence.

Avalanche Development Company
     In July 1993, Interleaf Inc. acquired Avalanche Development Company.
     The acquisition has enabled the two companies to enhance their working
     relationship through the integration of their technologies and
     expertise in document management and SGML.  New product releases
     shipping in early Summer '94 include FastTAG 2.0 and SGML Hammer 2.0.

Büro für Software-Entwicklung (BSE)
     Version 1.0 of LEX2SGML was released this April by Büro für
     Software-Entwicklung (BSE) and is available for DOS and SunOS.
     LEX2SGML takes more or less formatted plain text files, recognizes the
     structural elements of law texts like sections, references and so, and
     converts them to SGML.  Further development is targeted at providing
     interactive conversion and editing under MS Windows and adding
     object-oriented database functionality.

Corel Corel, best known for its CorelDraw software, recently purchased
     Ventura Publisher and has reissued it as Corel Publisher.  As part of
     its newest release it is bundling in, at no extra cost, an SGML import
     mechanism created by Zandar, the makers of TagWrite.  This makes Corel
     the first mass-market software publisher to include SGML functionality
     as part of its base product.

Datalogics, Inc.
     Datalogics announced the port of WriterStation to Windows and Windows
     NT.  Table editing has also been announced as part of the port.

Electronic Book Technologies Inc.
     Electronic Book Technologies (EBT) introduced an Asian language
     version of its DynaText Electronic Book Publishing System.  According
     to EBT, the underlying multi-byte technology is capable of handling a
     variety of Asian languages including Chinese and Korean.

Exoterica Corporation
     Exoterica will ship OmniMark Version 2 Release 4 in July.  This
     release provides important SGML-to-SGML transformation features
     including programmer access to comments and marked sections.
     Exoterica has opened a European office to expand its presence in
     Europe based in Paris and Lotus Development Corporation's Word
     Processing Division has licensed Exoterica's SGML parser, the SGML
     Kernel.

Grif, S.A.
     Grif has released PC-Windows versions of all its SGML products and
     notably GATE, an interactive object-oriented API that facilitates
     integration of the Grif Users' Services with existing information
     systems.  April 1994 saw the release of Grif SGML ActiveViews, an SGML
     viewer that shares the same interface as Grif SGML Editor.  Grif has
     also announced a distribution deal with Canadian-based Microstar, for
     its Near & Far software tool, adding to GRIF's range of PC software
     products and services for handling SGML.

HaL Software Systems
     HaL Software Systems have announced the Olias browser, an electronic
     document delivery product which will provide seamless access to both
     SGML documents and World Wide Web documents.

InContext Systems Inc.
     Canada's Department of National Defense has awarded InContext a
     further contract valued at US$371,000.  The contract is to aid in the
     development of a pilot system providing an operational example of the
     planned DND publishing architecture through which DND will process
     CALS standard information from internal sources and external
     contractors.  InContext is currently installing the first DND
     reference CALS system, under an earlier contract valued at $303,000.

InfoAccess Incorporated
     To solve the problem of displaying and manipulating highly complex
     tables in electronic form, InfoAccess Inc. now offers the addition of
     an enhanced table viewing component to GUIDE Professional Publisher
     Version 3.5 (GPP).  This new module enhances GUIDE Professional
     Publisher's ability to automatically convert paper-based information
     into interactive documents optimized for on-screen display.

Information Dimensions Inc.
     In the last 6 months, Information Dimensions, Inc has released
     commercial versions of its SGMLServer on eight operating systems (VMS,
     Windows and most of the large UNIX systems).  Integrations with
     several of the leading SGML author/editing/viewing systems have been
     prototyped.  IDI has also prototyped a sample integration with its own
     product, BASISDesktop, for access to relational, proprietary
     word-processing and SGML data.

MID/ILG GmbH
     MID has been very active throughout the past year in developing new
     SGML-based products, such as the recently launched SGML export filter,
     Metamorphosis, a new product first introduced at CeBIT'94 in Hannover,
     Germany, that uses common text processing systems (e.g., WordPerfect,
     Microsoft Word, FrameMaker, TeX) as the formatter engine.

Nihon Unitec Company Ltd
     Nihon Unitec, a company providing SGML solutions, is developing an
     SGML parser and editor for the Japanese environment.

OMI Logistics Limited
     OMI Logistics Limited has recently gained two contracts from the UK
     Ministry of Defence for the production of trial Interactive Electronic
     Technical Manuals (IETMs) for two parts of the UK Navy.  The first
     republishes a wide range of material as MILSPEC-compliant IETMS on PC
     notebooks and assesses user reaction.  The second re-publishes a large
     quantity of Fleet Air Arm helicopter manuals.

Open Text Corporation
     Open Text Corporation is launching two byte character support.  The
     product is now truly language independent.  Japanese, Chinese, Korean
     among the languages now benefiting from Open Text's SGML Query
     database.  June is the scheduled release data for PAT 5.0 which Open
     Text describes as by far the most significant to date.  In response to
     a more clearly perceived requirement for SGML databases, Open Text is
     forming partnerships in Belgium, Italy, Sweden, and the UK.

Publishing Development AB
     Publishing Development has announced the release of SGML Companion,
     which provides users with an intuitive non-programming way of
     creating, maintaining and browsing DTDs.

SoftQuad Inc.
     A major highlight of SoftQuad's year was signing an exclusive
     worldwide sales and marketing agreement with Synex Information of
     Sweden.  Synex created the SGML browser and content delivery tool
     called SGML Darc which was previewed at SGML Europe last year and
     which has now been released as SoftQuad Explorer.  In March the
     company shipped SoftQuad Author/Editor 3.0 with industry specific
     templates and support for notations.  Also in March, the company
     announced SoftQuad SGML Enabler for Quark Xpress and received a
     Seybold Award for Excellence honourable mention for it.

STEP -- Stürtz Electronic Publishing GmbH
     In the past year, STEP has added to the number of third party products
     it represents and uses in its system integration projects.  These
     include the ZIFTech DTD Viewer, Microstar's Near & Far, and SoftQuad's
     Author/Editor.  STEP's DocMan, a document management system designed
     for large amounts of data, is also now available.

TechnoTeacher Inc.
     At CALS Expo in November 1993, TechnoTeacher demonstrated their
     MarkMinder and HyMinder Engines.  These products are intended for use
     by persons who are building software applications using C++.
     MarkMinder is an SGML system which was designed specifically to
     support the demands of HyTime.  HyMinder consists of a set of object
     classes that facilitate the development of HyTime-compliant
     applications.

T.I.M.E. LUX
     T.I.M.E. Lux, a European consultancy, are announcing three SGML
     products that will be released this year: an SGML editor using Unicode
     fonts under windows, such as Microsoft, Macintosh, and UNIX; an SGML
     Object Oriented Parser, developed in C++, enabling tree management;
     and an SGML Work Flow Manager under UNIX, Windows, and Macintosh.

US Lynx Inc.
     US Lynx is consulting on a major Army initiative to compile over a
     dozen Army specifications for technical documentation into a single
     standard called MIL-STD-361A.  The modularity-structured DTDs use only
     content-oriented elements at the major hierarchies and incorporate
     HyTime concepts.  The standard is due for release in Fall 1994.


7. EVENTS

Documation'94
     This past February, the first Documation conference was held in Los
     Angeles.  This PTM- and GCA-sponsored conference was the first major
     event to put SGML and document management together.

HyTime/SGML Seminar
March 1994 in Boston & 13-14th October 1994 in Singapore
     In the early 1980's, at the dawn of electronic publishing, the Graphic
     Communications Association sponsored a series of talks by Dr. Charles
     Goldfarb that launched SGML.  Now, a decade later, as electronic
     publishing is metamorphosing into ``integrated open hypermedia'', the
     GCA has arranged for Charles to share his insights into this change by
     presenting a two-day ``HyTime/SGML Seminar''.  A US-based seminar was
     held in Boston in March, and the Asian seminar will be held in
     Singapore in October, in conjunction with SGML Asia '94.

Getting on the Information Highway
6-7th June 1994, Washington Court Hotel, Washington, DC, USA

On-line Publishing Conference
13-16th June 1994, Holiday Inn Crowne Plaza, New York, New York, USA

European HyTime Workshop
5-6th July 1994, Le Meridien Paris Etoile, Paris, France

First International Conference on the Application of HyTime
26-27th July 1994, Westin Bayshore Hotel, Vancouver, British Columbia, Canada
Preceded by a CApH meeting and tutorials

Pharmaceutical Conference
14-15th August 1994, Sheraton Imperial, Research Triangle Park, North
Carolina, USA

SGML Asia '94
10-12th October 1994, Regent Hotel, Singapore

SGML '94
7-10th November 1994, Sheraton Tyson's Corners, Virginia, USA

8. MISCELLANEOUS

SGML Open
     SGML Open, the consortium of vendors and users, has added 3 new
     members in 1994, bringing total membership to 38.  The consortium's
     first Technical Resolution on a common approach to Entity Management,
     has gone to sponsor members for vote.  SGML Open is participating with
     the SGML Users' Group and GCA in a Joint Committee on Conformance
     Testing.  Over the last year, a Board of Directors was elected, a
     full-time Executive Director hired, and marketing/public relations and
     serious technical committee work has begun.  The first Annual General
     Membership Meeting was held in December 1993 and there will be another
     general meeting on Wednesday of this week.

Interest in SGML
     The burgeoning interest in SGML was reflected in the highest
     attendance yet at an SGML conference with 450 persons at SGML'93 in
     Boston last December.  The response has continued in Europe with 250
     attending this conference, 60% above the past peak in May'94, the 10th
     year of the conference.  This growth reflects the many years of hard
     work and heavy investment on the part of GCA.

EPSIG -- Electronic Publishing Special Interest Group
     Graphic Communications Association Research Institute (GCA) has
     assumed the management of EPSIG originally formed by the AAP to foster
     the use of the AAP Electronic Manuscript Project DTDs and, in
     parternship with MacAfee and McAdam, has rejuvenated the EPSIG
     newsletter and training seminars.

European Procurement Handbook on Open Systems
     Version 3 of the European Procurement Handbook on Open Systems (EPHOS)
     will include SGML as part of the guidelines for public procurement.

SGML Project
     Our final item is on a sad note.  The funding for the SGML Project, an
     initiative at Exeter University to raise awareness about SGML in UK
     universities, has come to an end after more than 3 years.  During that
     time, Michael Popham, the project's only full-time worker has
     presented seminars on SGML in most UK universities.  Directed by Paul
     Ellison, the project has run and maintained an anonymous file-server
     (the first with ARCSGML!), and has dealt with SGML queries and
     questions from over 500 organizations - academic and commercial --
     throughout Europe, and as far afield as Australia, Mexico, japan and
     Iceland!  Before the project actually finishes, there will be a
     two-day workshop on ``Document Interchange & Hypermedia'' to which all
     UK Higher Educational establishments have been invited.
     Congratulations to all on a job well done and a major contribution to
     the acceptance of SGML.
</message>
<message id="<Cq272n.IEC@freenet.carleton.ca>" date="2978355406">
Newsgroups: comp.text.sgml
Date: 19 May 1994 16:56:46 UT
From: Gerry Bourguignon \<ad767@freenet.carleton.ca>
Organization: The National Capital FreeNet
Message-ID: \<Cq272n.IEC@freenet.carleton.ca>
Subject: Second edition of Practical SGML

Hi,

I was just wondering what the reaction is to the recently released second
edition of Practical SGML by Eric van Herwijnen.

I am not qualified to comment on it from the perspective of technical
accuracy, but I do think it is a great improvement in terms of presentation
over the first edition.  I took advantage of the prepublication discount
and certainly feel that I got my moneys worth.

What do others think?  (Assuming some of you already have it in hand).
Perhaps I should point out that it is the only text on SGML that I've
looked at.  I had borrowed the first edition from a friend.  From reading
bibliographies on SGML and comments elsewhere, it seems that most agree
that Charles Goldfarb's SGML Handbook is the SGML "bible".  However, I have
also heard that it is very technical and, therefore, somewhat daunting for
the beginner.

Anyway, that's my two cents worth.

Regards,
--
                                         "...the fault of the sailor--the
Gerry Bourguignon              /|\\       fault of he who asks no questions
ad767@freenet.carleton.ca    /__+__\\     about the cargo he is carrying."
Ottawa, Ontario, Canada    \\====|====/   -Daniel Lanois (Fisherman's Daughter)
</message>
<message id="<2rgi2r$nfb@search01.news.aol.com>" date="2978368027">
Newsgroups: comp.text.sgml
Date: 19 May 1994 20:27:07 UT
From: Eliot Kimber \<drmacro@aol.com>
Message-ID: <2rgi2r$nfb@search01.news.aol.com>
Subject: HyTime Samples

As it happens, I have put a demonstration HyTime program out on the SGML
Archive server (ftp.ifi.uio.no), under pub/SGML/Demo (or rather, Erik put
it out there after selflessly correcting the egregious mangling America
Online perpetrated on the files I sent him).  This Perl script, which I
call HyTest, is an extension of the SGMLS.PL Perl script James Clark
provides with SGMLS.  Just like SGMLS.pl, HyTest is intended to provide a
basic processing framework you can use to quickly build your own
applications.

HyTest is far from complete, or even bug free.  I had to do it on late
nights and weekends when I should have been doing something else
(unfortunately, the idea of providing useful HyTime processing on top of
SGMLS hijacked my brain and wouldn't let go until I'd made it real).
HyTest implements nameloc/nmlist indirect locations, including aggregate
locations and cross-document locations.  HyTest also attempts to validate
all those aspects of HyTime it implements, including validating the HyTime
support declarations.  There are a couple of reportable HyTime errors I
haven't quite gotten to, but it will tell you if your support declarations
match both what HyTest can support and what you actually use in your
document.  It also catches, for example, the use of an aggregate location
for an anchor for which "#AGG" was not specified.

HyTest thus demonstrates agorithms and data structures you can use to
implement these basic parts of HyTime and gives you enough function that
you could build useful applications with it.  For example, if you've
already used SGMLS.PL to build a DTDx-to-HTML filter, you could use HyTest
to add HyTime-links-to-HTML-links transforming as well (treating URLs as
the system identifiers of document entities, naturally).

Calculated and stored, but not yet used, are tree locations for each
element and the record number of each element's start tag record in the
SGMLS output file.  The tree location can be used to implement support for
all the structure locations, and the element record numbers allow random
access of the document data by doing direct seeks into the SGMLS output
file, treating it like an "in core" representation of the parsed SGML
document.

My Perl skills are nascent, so I'm sure there are many inefficiencies in
the code, which I'd be very happy to have corrections for.  The code is
offered as is for educational purposes and my hope is that you'll take it
and extend it for your own purposes, or take the algorithms and use them to
develop more sophisticated programs.

Below is a sample HyTime document that I used to test HyTest.pl.  It
contains some errors to exercise the error reporting functions of HyTest.
I've commented on those below.  Because I haven't yet implemented the
random access feature of HyTest, forward references won't be resolved, so
any location address elements or direct reference targets need to precede
the reference to them, although the DTD for the document does not enforce
that rule.  The document needs an SGML declaration that extends the name
length and turns omittag on, such as the latest Docbook declaration.

\<!-- HyTime module use declarations.  Must be before DOCTYPE
     declaration.  Could be in file that contains SGML declaration
     following close of SGML declaration itself.
 -->
\<?HyTime VERSION "ISO/IEC 10744:1992" HYQCNT=32 >
\<?HyTime MODULE base exidrefs dcnatts>
\<!-- Base module.  Exidrefs are cross-document references.  DCNAtts
     is data content notation atts, e.g., the all-dcn attributes
     used for data entities.  This dtd declares the notation SGML
     which uses the HyTime data attribute value "document".  See
     Catalog of Architectural Forms for details.
 -->
\<?HyTime MODULE measure garbage markfun>
\<!-- Measurement module.  Both options are errors of different
     sorts.   "garbage" is not a valid option at all, while
     markfun (marker functions) is not supported by HyTest.pl.
-->
\<?HyTime MODULE locs multloc anysgml anydtd mixspace coordloc>
\<!-- Location address module.  Key option here is "multloc", which
     allows multiple locations, which this document uses and
     HyTest supports.
  -->
\<?HyTime MODULE links >
\<!-- Hyperlinking module.  No relevant options for this document -->
\<!DOCTYPE HYTEST [
  \<!ENTITY % phrases  "#PCDATA | Phrase | NavLink | Citation"
  >
  \<!ENTITY % locationaddress  "nameloc | treeloc | dataloc"
  >
  \<!ENTITY % ilinks  "semioticlink"
  >
  \<!ELEMENT HyTest - - (Title, P+)
                                 +(%locationaddress; | %ilinks;)
  >
  \<!ATTLIST HyTest
     ID        ID    #IMPLIED
     HyTime  NAME #FIXED "HyDoc"
  >
  \<!ELEMENT Title  - O (%phrases;)* >
  \<!ATTLIST Title
     ID        ID    #IMPLIED
     HyTime  NAME #FIXED "HyBrid"
  >

  \<!ELEMENT P    - O (%phrases;)*
  >
  \<!ATTLIST P
     ID        ID    #IMPLIED
     HyTime  NAME #FIXED "HyBrid"
  >
  \<!ELEMENT Phrase    - O (%phrases;)*
  >
  \<!ATTLIST Phrase
     ID        ID    #IMPLIED
     Style     (norm | inherit | italic | bold | smallcaps | bolditalic) inherit
     HyTime  NAME #FIXED "HyBrid"
  >
  \<!ELEMENT Citation    - O (%phrases;)*
  >
  \<!ATTLIST Citation
     ID        ID    #IMPLIED
     HyTime  NAME #FIXED "HyBrid"
  >
  \<!ELEMENT NavLink  - O (%phrases;)*
  >
  \<!ATTLIST NavLink
     linkend   IDREF  #REQUIRED
     ID        ID      #IMPLIED
     HyTime  NAME   #FIXED "clink"
  >
  \<!ELEMENT SemioticLink - O (P+)*
  >
  \<!ATTLIST SemioticLink
     linkends  IDREFS #REQUIRED
     ID        ID      #IMPLIED
     anchrole  CDATA #FIXED "symbol referent #AGG"
     HyTime  NAME   #FIXED "ilink"
     refrange  CDATA  #FIXED "#ALL X"
  >
  \<!ELEMENT nameloc - O (IDlist | EntityList)*
                        -(nameloc)
  >
  \<!ATTLIST nameloc
     ID        ID      #REQUIRED
     HyTime  NAME   #FIXED "nameloc"
     aggloc    (aggloc | agglink | nagg) nagg
  >
  \<!ELEMENT (IDlist | EntityList) - O (#PCDATA)
  >
  \<!ATTLIST IDlist
     HyTime   NAME   #FIXED "nmlist"
     nametype NAME   #FIXED "element"
     docorsub  ENTITY #IMPLIED
  >
  \<!ATTLIST EntityList
     HyTime   NAME   #FIXED "nmlist"
     nametype NAME   #FIXED "entity"
  >
 \<!NOTATION SGML PUBLIC "ISO 8879:1986//NOTATION SGML//EN">
 \<!ATTLIST  #NOTATION SGML
               HyTime NAME #FIXED "document"
  -- HyTime data attribute indicates that entities with this
     notation are SGML document entities. --
 >

 \<!ENTITY frenchhist PUBLIC "-//DRMACRO//DOCUMENT French History//EN"
                        CDATA SGML
  -- An SGML document entity.  Note that HyTime data attribute is
     not specified because it's fixed for notation "SGML" --
 >
]>
\<HyTest>
 \<title>Being An Exploration of the semiotics of
\<citation>Foucalt's Pendulum\</citation>
by Umberto Eco
 \</title>
 \<nameloc id=frenchhist>\<IDlist docorsub=frenchhist>\</nameloc>
 \<!-- Note: empty nmlist refers to document element of docorsub
            which HyTest reports as "#DOMROOT"
   -->
 \<nameloc id=belbos agglink>\<IDlist>bilbobagins casuabon\</nameloc>
 \<p id=bilbobagins>Bilbo Baggins is a character in the
  \<citation>The Hobbit\</citation>.
 \<p id=belbo>Belbo is a character in Umberto Eco's
  \<citation>Foucault's Pendulum\</citation>.
 \<p id=casaubon>Casaubon is the narrator in \<citation>Foucault's Pendulum\</citation>.
 \<p id=templars>The Knights Templar are at the center of many conspiracy
theories, including that the Templars were responsible for the
\<!-- Simple cross-document hyperlink: -->
\<navlink linkend=frenchhist>French Revolution\</navlink>.
\<!-- Independent link: -->
  \<semioticlink linkends="belbos belbo">
\<!-- See what happens if you switch the order of the
     linkends values to "belbo belbos".  -->
   \<p>There is a deep meaning to this link.  See
also \<navlink linkend=templars>templars\</navlink>.
\<!-- No reason an independent link can't itself contain another
     link, in this case a contextual link.
  -->
  \</semioticlink>
\</HyTest>
--
\<Address HyTime=bibloc>
W. Eliot Kimber (kimber@passage.com) Systems Analyst and HyTime Consultant
Passage Systems, Inc., 9971 Quail Blvd., Suite 903, Austin TX 78758, +1 512 339 1400
465 Fairchild Dr., Suite 201, Mountain View, CA  94043, +1 415 390 0911
\</Address>
</message>
<message id="<2rgjjh$nr3@search01.news.aol.com>" date="2978369585">
Newsgroups: comp.text.sgml
Date: 19 May 1994 20:53:05 UT
From: Eliot Kimber \<drmacro@aol.com>
Message-ID: <2rgjjh$nr3@search01.news.aol.com>
Subject: What Does HyTime Require?

The rules for creating element types that conform to HyTime architectural
form are, in brief:

1.  Element forms shown as required in the HyTime standard must be used.

2.  Element forms that are optional need not be used (unless otherwise
    indicated in normative text).

3.  Attributes shown as #REQUIRED or #FIXED-in-DTD must be used.

4.  Attributes that have a declared default value (#IMPLIED, default
    keywords, or CDATA default values) can be omitted, in which case the
    HyTime processor will provide the defaults defined by the HyTime
    standard.

5.  DTDs can always be more restrictive than the HyTime standard requires.
    For example, if HyTime allows a choice of forms or attribute values, a
    DTD need not allow any that are not otherwise required.  By the same
    token, attributes for which HyTime allows a choice of values can be
    fixed to a single value in a DTD.

You can declare element types whose content models allow element instances
that would not be valid HyTime elements, because validation is done for
document and element instances, not DTDs.  For example, in a descriptive
DTD, you might need to support both a HyTime and non-HyTime way of doing
the same thing.  In this case, it will be up to the document author to
ensure that their document is a valid HyTime document, if that is their
intent.

As a general rule that is implicit in the use of standards, you should
never doing something in an application-defined way if the standard defines
a way to do the same thing.

--
\<Address HyTime=bibloc>
W. Eliot Kimber (kimber@passage.com) Systems Analyst and HyTime Consultant
Passage Systems, Inc., 9971 Quail Blvd., Suite 903, Austin TX 78758, +1 512 339 1400
465 Fairchild Dr., Suite 201, Mountain View, CA  94043, +1 415 390 0911
\</Address>
</message>
<message id="<9405192126.AA31203@source.asset.com>" date="2978371563">
Newsgroups: comp.text.sgml
Date: 19 May 1994 21:26:03 UT
From: Len Bullard \<bullardc@source.asset.com>
Message-ID: <9405192126.AA31203@source.asset.com>
Subject: Xenoforms

As with all brains-of-slug, I have erred.  Earlier I posted to David
M. that I thought xenoforms might be our only way to embed behavior.  As
usual, I have short-changed Charles usual thoroughness.  10744 - 7.4.1.3
Marker Function offers an alternative by which functions can be noted by
formal notation (fixed in DTD) or by application processing of the generic
identifier (switch on tag, sorta).

Mea culpa..mea culpa (readas: len, you ignorant slut).

Len Bullard
</message>
<message id="<1994May20.033714.11719@let.rug.nl>" date="2978393834">
Newsgroups: comp.text.sgml
Date: 20 May 1994 03:37:14 UT
From: Harry Gaylord \<galiard@let.rug.nl>
Organization: Faculteit der Letteren, Rijksuniversiteit Groningen, NL
Message-ID: <1994May20.033714.11719@let.rug.nl>
References: <1994May16.145935.9648@xgml.com> <9405172030.AA37387@source.asset.com> <2rfll4INNedp@oasys.dt.navy.mil>
Subject: Re: Tool Limitations

I am not sure we are so limited.  Here we have access to Author/Editor and
Markit/Writeit for editing in an SGML context in addition two a couple of
emacs versions which also help.  If we don't have an SGML application for
some purpose, we use Omnimark to write a transducer to a piece of non SGML,
like Latex -> dvips -> ps for typesetting or tact for text analysis.  We
maintain one form of the text in conforming SGML.  We have access to some
texts in the dynabook browser, but not the dynabook publisher yet.  Of
course, this is expensive, but for academic developers of SGML most of the
SGML companies give significant reduction on prices.

In the meantime we can concentrate on building new integrated tools or
applications based on these systems for new purposes based on SGML, HyTime,
and the forthcoming DSSSL.

Harry Gaylord
</message>
<message id="<2rhbkf$sn@crl2.crl.com>" date="2978394191">
Newsgroups: comp.text.sgml
Date: 20 May 1994 03:43:11 UT
From: Steve Zimmerman \<stevezim@crl.com>
Message-ID: <2rhbkf$sn@crl2.crl.com>
Subject: Selling SGML Capability

I have been a lurker to this newsgroup for about 6 months because of a
special interest in SGML which I'd like to discuss in this post & ask for
comments.

Although I have purchased & am studying the 2nd edition of "Practical
SGML", my interest is not specifically in learning to use the standard to
create documents.

I am familiar enough with SGML to understand its great potential.  To put
it more plainly, I (and I assume most of you posting to this group, perhaps
to varying degrees) am sold on SGML and its approach as the wave of the
future.  The weekly magazine "U.S. News & World Report" recently identified
SGML as one of 25 office technologies of the future.

My background is in marketing and selling.  For the past 19 years, I have
run my own export trading company, specialising in electronic components,
mostly semiconductors.  This business is maturing to the extent there is
too much competition and the margins of 10-20 years ago aren't available
anymore.

Now to get to the point of this post: I want to find a way to get involved
in selling SGML capability to small & medium sized firms who need it but
may not even know it.  I think there are two complementary business areas:

1.  Acting as a consultant by recommending or selling software, hardware
    and knowhow to get a firm started in SGML.

2.  Acting as a service bureau to do SGML work either as a bridge for firms
    just starting out or overflow work for firms involved with SGML that
    have more work than people to do it.

I would now like to invite comments, suggestions, criticisms, flames or
anything else that might add to my knowledge.

I realise this newsgroup is primarily for the purpose of exchanging
technical information and the subject matter of this post may not be
appropriate.

If that is the case, please email me directly at \<stevezim@crl.com> or
contact me at my office shown below.

--
Steve Zimmerman, President
Teksel Inc.
4310 N. 75th St.
Scottsdale, AZ 85251
Phone: 602-994-8224
Fax:   602-994-1955
</message>
<message id="<2rgk78$5va@felix.dircon.co.uk>" date="2978395060">
Newsgroups: comp.text.sgml
Date: 20 May 1994 03:57:40 UT
From: Bruce Hunter \<bruce@sgml.dircon.co.uk>
Message-ID: <2rgk78$5va@felix.dircon.co.uk>
References: \<Cpy5s7.1ww@unx.sas.com> \<DMEGGINS.94May18122619@aix1.uottawa.ca> <9405181849.AA58090@source.asset.com>
Subject: Re: Defining a Display in HTML

[Len Bullard]

[talking about IADS, I think]

|   If he requires a more sophisticated form, a commercial version with
|   more features and some HyTime support is available.


Really?  I didn't know this.  Do you know where more info can be obtained
on this?

regards,
--
	Bruce Hunter
	SGML Systems Engineering
	bruce@sgml.dircon.co.uk
</message>
<message id="<1994May20.091202.19026@leeds.ac.uk>" date="2978413922">
Newsgroups: comp.text.sgml
Date: 20 May 1994 09:12:02 UT
From: N F Drakos \<nikos@cbl.leeds.ac.uk>
Message-ID: <1994May20.091202.19026@leeds.ac.uk>
References: \<Cpy5s7.1ww@unx.sas.com>
Subject: Re: Defining a Display in HTML

[Mark Britt]

[textual explanation of the picture below deleted]

|   \<!-- ###################### EXAMPLE ########################### -->
|   \<!--      DISPLAY                      CODE                     -->
|
|   +----------------------------+         \<HTML>
|   |                            |         \<head>
|   |  In this section of the    |         \<title>Tutorial\</title>
|   |  tutorial, you learn how   |         \</head>
|   |  to:                       |         \<body>
|   |                            |         \<display>
|   |  > blah blah blah          |         \<p>
|   |                            |         In this section of the
|   |                            |         tutorial, you learn how
|   |   \<Continue>  \<Back>       |         to:
|   +----------------------------+         \<ul>
|   +----------------------------+         \<li>blah blah blah
|   |                            |         \</ul>
|   |  In this section of the    |         \</display>
|   |  tutorial, you learn how   |         \<display>
|   |  to:                       |         \<p>
|   |                            |         In this section of the
|   |  > blah blah blah          |         tutorial, you learn how
|   |  > blee blee blee          |         to:
|   |                            |         \<ul>
|   |   \<Continue>  \<Back>       |         \<li>blah blah blah
|   +----------------------------+         \<li>blee blee blee
|   +----------------------------+         \</ul>
|   |                            |         \</display>
|   |  In this section of the    |         \<display>
|   |  tutorial, you learn how   |         \<p>
|   |  to:                       |         In this section of the
|   |                            |         tutorial, you learn how
|   |  > blah blah blah          |         to:
|   |  > blee blee blee          |         \<ul>
|   |  > blow blow blow          |         \<li>blah blah blah
|   |                            |         \<li>blee blee blee
|   |   \<Continue>  \<Back>       |         \<li>blow blow blow
|   +----------------------------+         \</ul>
|                                          \</display>
|                                          \</body>
|                                          \</HTML>
|
|   \<!-- #################### END OF EXAMPLE ######################## -->

If you work with raw HTML the usual way of doing this is by splitting each
"display" into a separate file and then manually coding the \<continue> and
\<back> buttons yourself as hypertext links to the appropriate file.

Another alternative is to use a conversion tool that can read a document in
a different format and create several HTML files complete with navigation
buttons.  LaTeX2HTML certainly does this very easily and this is one of the
reasons it is so popular.  Webmaker (frame to HTML) provides similar
functionality.

Tools for "fragmenting" documents written in HTML in this way will start
appearing as soon as enough people ask for it.  The difficulties you are
having in doing what you want do not reflect limitations in HTML as it has
been suggested but rather limitations in the tools currently available to
create and manipulate HTML documents.

To demonstrate that what you want can be done with HTML have a look at
http://cbl.leeds.ac.uk/nikos/tmp/test/test.html which contains the
generated HTML version of the LaTeX document below (the table of contents
and a printed copy come for free..).

\\documentstyle{article}

\\begin{document}
\\title{Tutorial on Web Making}
\\author{Author Name Here}
\\maketitle
\\tableofcontents
\\section{1}
In this section of the
tutorial, you learn how to:
\\begin{itemize}
\\item blah blah blah

\\end{itemize}

\\section{2}
In this section of the
tutorial, you learn how to:
\\begin{itemize}
\\item blah blah blah
\\item blee blee blee
\\end{itemize}

\\section{3}
In this section of the
tutorial, you learn how to:
\\begin{itemize}
\\item blah blah blah
\\item blee blee blee
\\item blow blow blow
\\end{itemize}
\\end{document}

Nikos.

--
Nikos Drakos			
Computer Based Learning Unit   	nikos@cbl.leeds.ac.uk
University of Leeds		http://cbl.leeds.ac.uk/nikos/personal.html
</message>
<message id="<9405201247.AA14536@source.asset.com>" date="2978426829">
Newsgroups: comp.text.sgml
Date: 20 May 1994 12:47:09 UT
From: Len Bullard \<bullardc@source.asset.com>
Message-ID: <9405201247.AA14536@source.asset.com>
Subject: Re: Defining a Display in HTML

[Bruce Hunter]

|   Do you know where more info can be obtained on this?

Information on IDE/AS can be obtained from:

    William Schultz
    Unisys Corporation
    4000 South Memorial Parkway
    Huntsville, Alabama  35802

Len Bullard
</message>
<message id="<2riei2INNk7q@oasys.dt.navy.mil>" date="2978429953">
Newsgroups: comp.text.sgml
Date: 20 May 1994 13:39:13 UT
From: Betty Harvey \<harvey@oasys.dt.navy.mil>
Organization: Carderock Division, NSWC, Bethesda, MD
Message-ID: <2riei2INNk7q@oasys.dt.navy.mil>
References: <1994May16.145935.9648@xgml.com> <1994May20.033714.11719@let.rug.nl>
Subject: Re: Tool Limitations

[Harry Gaylord]

|   I am not sure we are so limited.  Here we have access to Author/Editor
|   and Markit/Writeit for editing in an SGML context in addition two a
|   couple of emacs versions which also help.  If we don't have an SGML
|   application for some purpose, we use Omnimark to write a transducer to
|   a piece of non SGML, like Latex -> dvips -> ps for typesetting or tact
|   for text analysis.  We maintain one form of the text in conforming
|   SGML.  We have access to some texts in the dynabook browser, but not
|   the dynabook publisher yet.  Of course, this is expensive, but for
|   academic developers of SGML most of the SGML companies give significant
|   reduction on prices.

Have you found a tool yet that is all-encompassing?  I don't think so.  I
haven't found an SGML editor yet (and I have tried many) that I can
whole-heartedly endorse without putting certain caveats.  Also, it is my
personal feeling (so please don't flame me) that the author/editor of an
SGML instance (document) shouldn't need to be SGML savvy.  The current
tools do not allowed 'the SGML unwashed' to sit down and start authoring
documents.  The translation of the DTD into a machine readable proprietary
format can be (and often is) a gruelling task.

|   In the meantime we can concentrate on building new integrated tools or
|   applications based on these systems for new purposes based on SGML,
|   HyTime, and the forthcoming DSSSL.

How we output SGML is currently our most difficult problem.  FOSI's were
supposed to allow interchange of formatting information between different
composition systems but they have failed for various reasons.  DSSSL
might be the solution but from my understanding it will be at least two
years before the standard is finished.

--
Betty Harvey  \<harvey@oasys.dt.navy.mil>     | David Taylor Model Basin
Advanced Information Systems Branch          | Carderock Division
Code 183                                     | Naval Surface Warfare
Bethesda, Md.  20084-5000                    |   Center
(301)227-3348   FAX (301)227-3343            | DTMB,CD,NSWC
</message>
<message id="<Cq3tn0.n4o@csc.ti.com>" date="2978431306">
Newsgroups: comp.text.sgml
Date: 20 May 1994 14:01:46 UT
From: Philip Thrift \<thrift@ra.csc.ti.com>
Organization: Texas Instruments
Message-ID: \<Cq3tn0.n4o@csc.ti.com>
Subject: Embedding LaTeX in HTML

There has been some discussion about including math and table data, which
LaTeX is good for, in HTML files (a problem until MATH and TABLE containers
are generally supported.  Below is a solution I use that some may find
useful (I have).  There is of course the latex2html package at

 http://cbl.leeds.ac.uk/nikos/tex2html/doc/latex2html/latex2html.html

and the method below uses pstogif, pstoppm.ps, pstoxbm from that package.
The approach I use below is sort of the opposite of latex2html: embed LaTeX
in HTML and translate.  Pending approval, I could probably release the
software, but the filter I wrote is simple, so I hope this explanation is
sufficient for now.  I am not aware of anyone else using this approach, so
I would be interested in any pointer to such.

The solution I use is to embed LaTeX data in HTML comments.  Anything in
\<!--#tex ... --> is considered LaTeX data.  This is similar to the way
'server side includes' are done (syntactically) in the latest WWW protocol.

Below is an example of an HTMX file (containing LaTeX in HTML comments):
example.htmx:
------------------------------------------------------------------
\<TITLE>Example of embedding LaTex in an HTML file\</TITLE>

\<P>Here is an example of a table:
\<P ALIGN=center>\<A NAME=table1>
\<!--#tex
    \\begin{tabular}{|l|l|r|} \\hline\\hline
    {\\em type} &
     \\multicolumn{2}{c|}{\\em style} \\\\ \\hline\\hline
     smart    & red  & short \\\\
     rather silly & blue & tall \\\\ \\hline\\hline
    \\end{tabular}
-->\</A>
\<P>Euler's equation looks like:
\<P ALIGN=center>\<A NAME=euler>
\<!--#tex
    \\[ e^{i\\pi} + 1 = 0  \\]
-->\</A>
\<P>where \<!--#tex \\(e\\)--> is the natural logarithm base
and \<!--#tex \\(i = \\sqrt{-1}\\)-->.
------------------------------------------------------------------

I wrote a filter 'axtex' that extracts the LaTeX "objects" from HTMX files
and produces a .tex and ultimately an image file for each one, and produces
an HTML file with the corresponding inlines:

example.html:
-------------------------------------------------------------------------
\<TITLE>Example of embedding LaTex in an HTML file\</TITLE>

\<P>Here is an example of a table:
\<P ALIGN=center>\<A NAME=table1>
\<IMG SRC="example_01.xbm">\</A>
\<P>Euler's equation looks like:
\<P ALIGN=center>\<A NAME=euler>
\<IMG SRC="example_02.xbm">\</A>
\<P>where \<IMG SRC="example_03.xbm"> is the natural logarithm base
and \<IMG SRC="example_04.xbm">.
-------------------------------------------------------------------------

The XBM files (or GIF could have been generated) are created by the 'axtex'
filter using the latex,dvips, and pstoxbm (or pstogif) utilities.

This approach is from the point of view of an HTML author who wants to
escape to LaTeX 'as needed'.

-pt
</message>
<message id="<Cq3z5F.K3t@brunel.ac.uk>" date="2978438444">
Newsgroups: uk.events,alt.hypertext,comp.text.sgml,comp.multimedia
Date: 20 May 1994 16:00:44 UT
From: Andrew Findlay \<Andrew.Findlay@brunel.ac.uk>
Organization: Brunel University, Uxbridge, UK
Message-ID: \<Cq3z5F.K3t@brunel.ac.uk>
Keywords: document sgml hypermedia hypertext text encoding database markup
Subject: Colloquium: Adding value to documents using markup languages

Adding Value to Documents using Markup Languages
================================================

IEE Colloquium June 6th 1994 at Savoy Place, London WC2


Information is a very valuable resource.  Much of any organisation's
information is in the form of documents: reports, proposals, operating
procedures, memos and papers.  Although documents are often created using
computers, the information they contain is not structured to allow easy
retrieval at a later date.

This colloquium will show how to add great value to documents, by applying
markup so that the individual parts of every document can be stored and
processed with database techniques.  Case studies from engineering, the oil
industry and publishing will demonstrate the benefits already being gained
from SGML and similar systems.


PROVISIONAL PROGRAMME

     10.00 am  Registration and coffee

     Chairman: A J Findlay (Brunel University)

1    10.30     Introduction to SGML concepts:
          P Ellison and M Popham (University of Exeter)

2    11.00     SGML the management language:
          P Lucas (Database Publishing Systems)

3    11.30     Document markup for open information interchange:
          M Bryan (SGML Centre)

4    12.00     The text encoding initiative:
          L Burnard (University of Oxford)

     12.30     LUNCH

5    2.00 Conversion of documents to and from SGML:
          B Hunter (SGML Systems Engineering)

6    2.30 Using SGML to produce reusable technical documentation:
          J J L Chelsom and M Pengelly (CSW Informatics and NAG)

7    3.00 Architectural choices for structured document repositories:
          T Bray (Opentext, Canada)

     3.30 TEA

     3.45 Discussion

     4.15 CLOSE

The IEE is not, as a body, responsible for the views or opinions expressed
by individual authors or speakers.

SGML = Standard Generalized Markup Language


Costs are:

	IEE Members:					 42.00 pounds
	IEE Retired, Unemployed, Student members:	 NO CHARGE
	Non-members:					 70.00 pounds
	Retired, Unemployed, Student non-members:	 21.00 pounds

	Lunch:						  4.70 pounds

Members of IEEIE and Eurel Member Associations will be admitted at the
members rate.

All prices above include VAT

Please note that people wishing to attend MUST complete an application form
so they should contact Kate Davis at the IEE on +44 71 240 1871 Ext: 2206
or email (providing their postal address) kdavis@iee.org.uk

--

----------------------------------------------------------------------------
|      From Andrew Findlay at Brunel University, Uxbridge, UB8 3PH, UK     |
| Andrew.Findlay@brunel.ac.uk       +44 895 203066 or +44 895 274000 x2512 |
----------------------------------------------------------------------------
</message>
<message id="<9405201654.AA31446@source.asset.com>" date="2978441678">
Newsgroups: comp.text.sgml
Date: 20 May 1994 16:54:38 UT
From: Len Bullard \<bullardc@source.asset.com>
Message-ID: <9405201654.AA31446@source.asset.com>
References: <1994May16.145935.9648@xgml.com> <1994May20.033714.11719@let.rug.nl> <2riei2INNk7q@oasys.dt.navy.mil>
Subject: Re: Tool Limitations

A few years ago, the Context Corporation of Mentor created a very nice
WYSIWYG style interface for basic SGML (did not support entities).  It fell
down as most of these things do when they tried to hardwire aspects of the
DTD.  That is, if it could not be expressed with their objects (lists,
paragraphs, etc) it could not be expressed.  Eventually, they despaired of
the idea and some very excellent talent (Charlie Sorgi, Greg LLoyd, etc)
went their own way and are probably doing excellent work elsewhere.  Of
course, Charlie used to refer to us as the "SGML Nazis" so maybe "SGMList"
ain't too bad.

It seems that everyone relies on translation of SGML for almost everything.
This makes event processing something of an awkward task and is a major
headache for third-level systems.

For object-oriented systems to do the kinds of things that do not hardwire
the DTD into the objects, an object-oriented system must be more SGML-aware
at the level of the standard (syntax, not semantics) and more adept at
reasoning how the semantics expressed in the DTD can be adapted to the
processing required.  For instance, it is not enough to specify that
"reuse" is a goal of a DTD design.  Abstract Data Types (of which a DTD is
a subset defined by 8879), always require both a type definition and a set
of processes (functions/methods/whatever) that can be against the data
types.  SGMLists can always define the data type, but the functions are
left to the discretion of the application, and that is where the disconnect
begins.  SGML makes it possible to notate these functions in several ways,
but like all data-driven systems, does not provide a way to encode them.

Perhaps DSSSL will provide a standard means for this kind of description in
a standard way.  HyTime provides a way to specify which functions should be
specified and that is something we are working on very hard... but, to
prevent the kind of features mismatches described by many who use SGML, we
may have to describe this as classes of functions that any system offered
for a certain class of mission must provably demonstrate.  If we cannot do
this, and soon, it is very likely that the services will finally pick one
application and "go with it" until something better goes along.  To be
sure, a lot of people won't be happy with the choice.

Not SGML savvy?  You are right at the level that one shouldn't need to know
how an internal combustion engine runs to drive a car.  A view dimension is
always defined as the unit of resolution of address required to make a
given decision at a given time in a given environment.  However, if I want
to do things more cheaply, it always nice to know how to change the oil and
fill the gas tank.  Some applications let you see the innards, and some
have "no user serviceable parts."  Near and Far is a nice application in
that it allows one to create in a cooperative way, a description that
becomes a DTD.  So it lets us concentrate on the cultural issues which are
usually more difficult.  Is this new?  No.  It is an application of CASE
technology that has been around for years.  What was the experience with
CASE tools?  They are useful to:

1.  Standardize methods and documentation.
2.  They relieve the tedium of repetitive work.
3.  If they are methodology dependent (roughly the equivalent of hardwiring
    the DTD) they can only be applied to the method for which which they
    were designed.
4.  If they are method-independent (i.e, 4th generation CASE tool), an
    application designer must configure the system for the task at hand.

Such tools allow us to control complexity and reduce the burden of certain
tasks.  Do they alleviate the necessity to learn to "program"?  Certainly
not.  If a user of a CASE tool can't program at a basic level, they will
not be able to handle the tools just as you shouldn't have a car if you
can't drive safely.

A further benefit of Near and Far is that it also allows us to "explore"
complexity.  one can look at the trees graphically and after awhile, apply
metrics that allow us to get a feel for how complex a DTD enables an
instance to become.  A future use of such tools will be to have metrics
that allow us to calculate the cost of artifacts such as an 87269 type
instance, and the efficiency of the instance (e.g., how many dialogs does
it take to change a light bulb).

What they cannot do is remove the need to understand SGML and how it is
used to design grammars.  One still must understand such notions as #PCDATA
and why it might be preferred to CDATA in a model.  like CASE tools which
are regularly rejected by programmers, its use must be an institutional
requirement.

Quality is in the depth of understanding of the relationship between
environment and need.  How well a system adapts to the environment by
modifying to meet its needs is the engine of evolution of the system.  As
Erik and others have cautioned us, don't mistake a fancy interface for a
quality product.  Near and Far is a quality product.  But it isn't always
needed and if required, should be applied by those whose level of
experience in design matches their need for quality.

Len Bullard
</message>
<message id="<JH.94May20203432@poseidon.Sietec.de>" date="2978447672">
Newsgroups: comp.text.sgml
Date: 20 May 1994 18:34:32 UT
From: Jochen Roger Hayek \<jh@poseidon.Sietec.de>
Organization: Sietec Systemtechnik, Berlin
Message-ID: \<JH.94May20203432@poseidon.Sietec.de>
Subject: what about sgml2{tex,roff}?

Are there any converters from SGML to TeX or roff, possibly using ASP
(Amsterdam SGML parser) "replacement files", so I can take sgmlsasp for it?

Where can I find appropriate files?

TIA
--
Jochen.Roger.Hayek@Sietec.de		 		// company
Jochen.Roger.Hayek@ACM.org, 100330.424@CompuServe.com	// private
</message>
<message id="<2rj2l1$ld6@dancer.cc.bellcore.com>" date="2978450529">
Newsgroups: comp.text.sgml
Date: 20 May 1994 19:22:09 UT
From: Haim Kilov \<haim@dancer.cc.bellcore.com>
Organization: Bell Communications Research (Bellcore)
Message-ID: <2rj2l1$ld6@dancer.cc.bellcore.com>
References: <24916@dog.cme.nist.gov> <19940515.2234@naggum.no> <2raob4$2ri@dancer.cc.bellcore.com> <19940519.2304@naggum.no>
Subject: Re: nesting entities

[Haim Kilov]

|   Exactly!  E.W.Dijkstra in his 1976 book "A discipline of programming"
|   (and perhaps even earlier than that) stated that at the beginning of
|   programming the goal of our programs was to instruct our machines, but
|   "now" we understood that the goal of our machines is to execute our
|   programs.

[Erik Naggum] 

|   not sure I got the connection here.  did you mean that we have
|   progressed from the state where the goal of our information was to
|   please programs, to the state where the goal of our programs is to
|   understand our information?  if so, I agree we have made the step
|   regarding machines and programs, but we are still a long way from
|   making a similiar step regarding programs and information.

Yes, I did (mean that...).  Thanks for your long message.

There exists a difference (described in our recent book on information
modeling) between, e.g., a user of a bank in 19th century when there were
no computers and when decisions have been made "on the fly" by competent
people (bankers), and a user of a bank now when decisions are
pre-programmed and are not made "on the fly" by competent people.  In 19th
century there was no pressing need to formulate a precise specification
because the bankers "knew" it, and they were the only people with a need to
know.  Now, the bankers don't deal with bank transactions on the fly, but
in order to write programs that will deal with these transactions, a
programmer needs a precise specification of what the program has to do.
Obviously, the banker has no slightest interest in the computer system's
restrictions, etc.  If these specifications are not provided by the banker
then the programmer will invent them.

The same with our information.  It has to be organized not in order "to
please programs": for example, the "need" to type in a 20-digit code is
justified by "pleasing programs" (and also lettering within boxes, and also
-- horror of horrors -- angle brackets).  So, both the programmer -- better
to say, "analyst", "modeler", or "specifier" -- and the user have to
understand that a precise specification is essential, and that it has to be
formulated in business terms rather than in programming terms.

-Haim Kilov
haim@cc.bellcore.com
</message>
<message id="<truly.86.000C719B@lunemere.com>" date="2978450794">
Newsgroups: comp.text.sgml
Date: 20 May 1994 19:26:34 UT
From: Truly Donovan \<truly@lunemere.com>
Organization: La Lunemere
Message-ID: \<truly.86.000C719B@lunemere.com>
References: \<Cq0MJ5.3JJ@ceco.ceco.com>
Keywords: Script SGML GML HTML conversion
Subject: Re: Script -> HTML ?

[Garry J. Garrett]

|  I have some documents in "Script", an IBM Mainframe product that uses a
|  mark up language called GML "Generalized Markup Language".  I think it
|  is similar, if not related to, SGML which seems to be talked about a lot
|  in this newsgroup.  I seem to hear a lot of people talking about
|  converting SGML to HTML (HyperText Markup Language).  Does anyone know
|  of any utilities to convert Script GML documents into HTML documents?
|  If you know of anything to convert SGML to HTML, I'll try that and see
|  if it works.  Thanks for any assistance that you can give.  I would
|  prefer e-mail as I am not sure that I will be able to keep up with this
|  group in the coming weeks.

GML predates the SGML standard, and lacks the formal disciplines of SGML.
It is also limited by the implementation base, IBM Document Composition
Facility (aka SCRIPT).  This more or less accounts for some of its
idiosyncracies relative to SGML.  Nonetheless, it was a test bed for many
of the concepts and principles that are now embodied in the standard.

There are several things you need to be cautious about in converting SCRIPT
GML to anything:

1.  The parsing rules are such that you cannot correctly parse all
    documents without knowing the processing rules (buried in the
    application code) for each tag.  In particular, markup content
    separators are optional in some cases, required in others, and the
    determination of what constitutes an attribute can be tricky.

2.  SCRIPT doesn't care where it gets its orders from, so a document can
    and often does contain an intertwingling of tags and "control words"
    (SCRIPT processing instructions), and the control words can define new
    tags on the fly, redefine the processing, what-have-you.  I often speak
    of it as the only applications environment where the data can wrest
    control from the program and run amok.  Therefore, a conversion that
    deals only with the tags and not the control words is likely to require
    much manual post-processing.  Your documents may or may not have this
    problem.  In my experience, most of them do; for a long time, it was a
    "feature," not a curse.

3.  Although similar in many ways to entities, SCRIPT symbols can be
    defined (and redefined) anywhere in the document.

Good luck,
--
Truly Donovan
publishing consultant
La Lunemere
truly@lunemere.com
</message>
<message id="<DMEGGINS.94May20153159@aix1.uottawa.ca>" date="2978451117">
Newsgroups: comp.text.sgml
Date: 20 May 1994 19:31:57 UT
From: David Megginson \<dmeggins@aix1.uottawa.ca>
Organization: Department of English, University of Ottawa
Message-ID: \<DMEGGINS.94May20153159@aix1.uottawa.ca>
References: \<Cq3tn0.n4o@csc.ti.com>
Subject: Re: Embedding LaTeX in HTML

[Philip Thrift]

|   The solution I use is to embed LaTeX data in HTML comments.  Anything
|   in \<!--#tex ... --> is considered LaTeX data.  This is similar to the
|   way 'server side includes' are done (syntactically) in the latest WWW
|   protocol.

I'm not certain that putting this information in a comment is the best
idea.  Why not put it in marked sections or, better yet, in external files?
SGML is _designed_ to handle this sort of thing.  I don't think that there
is a registered public identifier yet for a LaTeX notation, but I remember
seeing some good suggestions posted to this list recently.

This would break your system.

  Here is some PCDATA \<!--#tex Here is some LaTeX code ----
  interesting, eh? --> and here is some more PCDATA.

David
--
David Megginson                Department of English, University of Ottawa,
dmeggins@aix1.uottawa.ca       Ottawa, Ontario, CANADA  K1N 6N5
dmeggins@acadvm1.uottawa.ca    Phone: (613) 564-6850 (Office)
ak117@freenet.carleton.ca             (613) 564-9175 (FAX)
</message>
<message id="<memo.169421@cix.compulink.co.uk>" date="2978451873">
Newsgroups: comp.text.sgml
Date: 20 May 1994 19:44:33 UT
From: Nick Barron \<nikb@cix.compulink.co.uk>
Message-ID: \<memo.169421@cix.compulink.co.uk>
Subject: ATA-100 & J2008

I'm after any information which is available over the net on the SGML parts
of ATA100 and J2008.  Any pointers most welcome.  Feel free to reply by
mail if you feel posting will waste bandwidth.

Thanks in advance.

Nik
</message>
<message id="<Cq4C4G.A06@unx.sas.com>" date="2978455263">
Newsgroups: comp.text.sgml
Date: 20 May 1994 20:41:03 UT
From: John Barnum \<sasbar@kanga.unx.sas.com>
Organization: SAS Institute Inc.
Message-ID: \<Cq4C4G.A06@unx.sas.com>
Keywords: double byte kanji
Subject: Double byte character sets

Can anyone tell me where I can find information about using KANJI with SGML
or point me in the direction of folks who are translating SGML documents
from English to Japanese using an SGML editor?

Thanks.
--
John Barnum        | SAS Institute Inc.
sasbar@unx.sas.com | +1 919 677 8000 x6068
</message>
<message id="<1994May20.151629.1034@vaxb.phx1.aro.allied.com>" date="2978460989">
Newsgroups: comp.text.sgml
Date: 20 May 1994 22:16:29 UT
From: Name Unknown \<b98429@vaxb.phx1.aro.allied.com>
Organization: AlliedSignal Engines
Message-ID: <1994May20.151629.1034@vaxb.phx1.aro.allied.com>
Subject: RE:Second Edition of Practical SGML

HI:

The second edition of Practical SGML by Eric Van Herwijnen was tremendously
valuable to me.  Being a novice at DTD creation and only an avid user of SGML
for documentation creation, the book covered all of the basic relatively
clearly.

I still felt at times that you had to have a programming backround to step into
creating a DTD.  I guess, I was expecting more of a how to type, (classic
textbook [workbook]) approach leading up to writing a so to speak sample DTD
and instance.  The memo structure, he used in his book, would have been a good
step by step, instructional tool.

The book was truely helpful.   Does anyone know of a simple step by step
instructional aide available for DTD creation?

Would really help novices like myself.
</message>
<message id="<Cq5Jy0.Mx@csc.ti.com>" date="2978512056">
Newsgroups: comp.text.sgml
Date: 21 May 1994 12:27:36 UT
From: Philip Thrift \<thrift@ra.csc.ti.com>
Organization: Texas Instruments
Message-ID: \<Cq5Jy0.Mx@csc.ti.com>
References: \<Cq3tn0.n4o@csc.ti.com> \<DMEGGINS.94May20153159@aix1.uottawa.ca>
Subject: Re: Embedding LaTeX in HTML

[David Megginson]

|   I'm not certain that putting this information in a comment is the best
|   idea.  Why not put it in marked sections or, better yet, in external
|   files?  SGML is _designed_ to handle this sort of thing.  I don't think
|   that there is a registered public identifier yet for a LaTeX notation,
|   but I remember seeing some good suggestions posted to this list
|   recently.
|
|   This would break your system.
|   
|     Here is some PCDATA \<!--#tex Here is some LaTeX code ---
|     interesting, eh? --> and here is some more PCDATA.

Actually this would "break" it:

   \<P>Here is some PCDATA \<!--#tex Here-->is some \\LaTeX code -->

but not your example since the filter looks for begin-comment end-comment
tags.

But I would like to see the better notations.  Putting the LaTeX entities
is separate files seems cumbersome for the HTML author, when the file
contains alot of LaTeX snippits:

   \<P>The suits in a standard deck of cards are \<!--#tex\\(\\clubsuit\\)-->,
   \<!--#tex\\(\\diamondsuit\\)-->,\<!--#tex\\(\\heartsuit\\)-->, and
   \<!--#tex\\(\\spadesuit\\)-->.

But the best way I would like to see.

-pt
</message>
<message id="<2rlb64$pne@search01.news.aol.com>" date="2978524804">
Newsgroups: comp.text.sgml
Date: 21 May 1994 16:00:04 UT
From: Eliot Kimber \<drmacro@aol.com>
Message-ID: <2rlb64$pne@search01.news.aol.com>
References: \<Cq5Jy0.Mx@csc.ti.com>
Subject: Re: Embedding LaTeX in HTML

[Philip Thrift]

|   \<P>The suits in a standard deck of cards are \<!--#tex\\(\\clubsuit\\)-->,
|   \<!--#tex\\(\\diamondsuit\\)-->,\<!--#tex\\(\\heartsuit\\)-->, and
|   \<!--#tex\\(\\spadesuit\\)-->.

Ignoring the non-SGMLness of HTML processing for a moment, there are really
only four proper ways to do the above, and putting it in comment
declarations is not one of them.  The four mechanisms SGML defines are:

1.  Processing instructions, either directly or via PI entities, e.g:

    \<?#tex\\(\\heartsuit\\)>

2.  SDATA entities would probably be more approriate when what's being
    defined is a special character:

    \<!ENTITY heartsuit SDATA "#tex\\(\\heartsuit\\)" >

2.  External data entities, e.g.:

    \<!ENTITY texstuff SYSTEM "mystuff.tex" NDATA TeX>

3.  As CDATA or RCDATA content of an element with a declared notation,
    e.g.,:

    \<!DOCTYPE MyDoc [ ...
    \<!ELEMENT TeXHolder - - CDATA >
    \<!ATTLIST TeXHolder
      notation NOTATION #FIXED "TeX"
    >
    ...
    ]>
    \<mydoc>
    ...
    \<texholder>\\(\\heartsuit\\)\</texholder>

The reason comments are not appropriate is that comments are, by
definition, completely invisible as document data (although some SGML
processing tools, like the newly-announced OmniMark, may provide access to
comments) and can therefore not be relied upon for any processing.  From
what I've seen in the examples so far posted, processing instructions are
the best way to encode the above.  Using SDATA entities for special
characters is probably the better approach from an SGML purity standpoint,
but it would require some sort of API for interpreting and resolving SDATA
entity references, while PIs are more direct (for example, with SGMLS, to
process SDATA entities, you'd have to build a symbol table from SDATA
entity records and then process SDATA entity references to resolve their
values from the SDATA table you've built, while processing PIs is a simple
direct call to the PI processing subroutine already provided by SGMLS.pl).
Note that PIs and SDATA entities are both non-portable ways of doing
application-specific stuff and are generally not portable across
applications.  The use of a keyword like "#tex" is a good practice, at
least allowing applications to distinguish PIs and SDATA entities intended
for specific processors.


--
\<Address HyTime=bibloc>
W. Eliot Kimber (kimber@passage.com) Systems Analyst and HyTime Consultant
Passage Systems, Inc., 9971 Quail Blvd., Suite 903, Austin TX 78758, +1 512 339 1400
465 Fairchild Dr., Suite 201, Mountain View, CA  94043, +1 415 390 0911
\</Address>
</message>
<message id="<Cq5zpt.4vn@csc.ti.com>" date="2978532496">
Newsgroups: comp.text.sgml
Date: 21 May 1994 18:08:16 UT
From: Philip Thrift \<thrift@ra.csc.ti.com>
Organization: Texas Instruments
Message-ID: \<Cq5zpt.4vn@csc.ti.com>
References: \<Cq5Jy0.Mx@csc.ti.com> <2rlb64$pne@search01.news.aol.com>
Subject: Re: Embedding LaTeX in HTML

[Philip Thrift]

|   ... \<!--#tex\\(\\clubsuit\\)--> ...

[Eliot Kimber]

|   Ignoring the non-SGMLness of HTML processing for a moment, there are
|   really only four proper ways to do the above, and putting it in comment
|   declarations is not one of them.  The four mechanisms SGML defines are:
|   
|   1. Processing instructions, either directly or via PI entities, e.g:
|      \<?#tex\\(\\heartsuit\\)>> ...
|    ...
|   The reason comments are not appropriate is that comments are, by
|   definition, completely invisible as document data (although some SGML
|   processing tools, like the newly-announced OmniMark, may provide access
|   to comments) and can therefore not be relied upon for any processing.

Thanks for the SGML-correct mechanisms.  I was led (inspired?) by the
current way 'server side includes' are done in HTML:

    e.g. \<!--#exec cmd="/bin/date"-->

introduces the date into the document by the server.

Now I don't know the reasoning behind this.  Does anyone know?

But conformity to SGML is promised in future generations of HTML.
</message>
<message id="<2rmdgc$a0i@news.delphi.com>" date="2978545328">
Newsgroups: comp.text.sgml
Date: 21 May 1994 21:42:08 UT
From: Jeffrey McArthur \<j_mcarthur@BIX.com>
Organization: Galahad
Message-ID: <2rmdgc$a0i@news.delphi.com>
Subject: ANSI/NISO/ISO 12083 Standard

I am looking for a SOFT copy of the proposed ANSI/NISI/ISO Standard 12083.
I have access to a hard copy, but is there a soft copy somewhere on the
net.

--
    Jeffrey M\\kern-.05em\\raise.5ex\\hbox{\\b c}\\kern-.05emArthur
    a.k.a. Jeffrey McArthur          ATLIS Publishing
    phone: +1 301 210 6655           12001 Indian Creek Court
    fax:   +1 301 210 4999           Beltsville, MD  20705
    home:  +1 410 290 6935           email: j_mcarthur@bix.com

The opinions express are mine and mine alone.  They do not reflect the
opinions of ATLIS Publishing or any part of ATLIS Systems.  ATLIS does NOT
pay for my access to the Internet.
</message>
<message id="<DMEGGINS.94May22074722@aix1.uottawa.ca>" date="2978596042">
Newsgroups: comp.text.sgml
Date: 22 May 1994 11:47:22 UT
From: David Megginson \<dmeggins@aix1.uottawa.ca>
Organization: Department of English, University of Ottawa
Message-ID: \<DMEGGINS.94May22074722@aix1.uottawa.ca>
References: \<Cq5Jy0.Mx@csc.ti.com> <2rlb64$pne@search01.news.aol.com> \<Cq5zpt.4vn@csc.ti.com>
Subject: Re: Embedding LaTeX in HTML

[Philip Thrift]

|   Thanks for the SGML-correct mechanisms. I was led (inspired?) by
|   the current way 'server side includes' are done in HTML:
|
|       e.g. \<!--#exec cmd="/bin/date"-->
|
|   introduces the date into the document by the server.

I have not seen this either in the HTML or the draft HTML+ spec -- it must
be a local kludge which someone who didn't really know much about SGML
introduced for an HTML server.  This could be handled quite easily several
ways in an SGML document.

David
--
David Megginson                Department of English, University of Ottawa,
dmeggins@aix1.uottawa.ca       Ottawa, Ontario, CANADA  K1N 6N5
dmeggins@acadvm1.uottawa.ca    Phone: +1 613 564 6850 (Office)
ak117@freenet.carleton.ca             +1 613 564 9175 (FAX)
</message>
<message id="<1994May22.125635.1038@dxcern.cern.ch>" date="2978600195">
Newsgroups: comp.text.sgml
Date: 22 May 1994 12:56:35 UT
From: Eric van Herwijnen \<evh@dxcern.cern.ch>
Organization: CERN European Laboratory for Particle Physics
Message-ID: <1994May22.125635.1038@dxcern.cern.ch>
References: <2rmdgc$a0i@news.delphi.com>
Subject: Re: ANSI/NISO/ISO 12083 Standard

ISO 12083:1994 is scheduled to appear soon. An annotated electronic version
that contains the complete text will be bundled with the standard.

A conversion utility for converting AAP texts to the ISO 12083 DTDs is
being sold by the GCA (AAP2ISO).  This also contains the DTDs.  Phone
Marion Elledge on 703-519-8160 for more info, or email me on
100043.3201@CompuServe.com

Eric van Herwijnen
</message>
<message id="<1994May22.130657.4591@dxcern.cern.ch>" date="2978600817">
Newsgroups: comp.text.sgml
Date: 22 May 1994 13:06:57 UT
From: Eric van Herwijnen \<evh@dxcern.cern.ch>
Organization: CERN European Laboratory for Particle Physics
Message-ID: <1994May22.130657.4591@dxcern.cern.ch>
Subject: SGML editor for Word 6.0

For all the people that have requested our new email address, the enquiries
concerning the SGML editor for Word 6.0 ("The SGML Tag Wizard") should now
be addressed to 100043.3201@Compuserve.Com.  The address at EVH@ALTERN.COM
should no longer be used.
</message>
<message id="<Cq7Hzu.GAC@csc.ti.com>" date="2978602842">
Newsgroups: comp.text.sgml
Date: 22 May 1994 13:40:42 UT
From: Philip Thrift \<thrift@ra.csc.ti.com>
Organization: Texas Instruments
Message-ID: \<Cq7Hzu.GAC@csc.ti.com>
References: <2rlb64$pne@search01.news.aol.com> \<Cq5zpt.4vn@csc.ti.com> \<DMEGGINS.94May22074722@aix1.uottawa.ca>
Subject: Re: Embedding LaTeX in HTML

[Philip Thrift]

|   e.g. \<!--#exec cmd="/bin/date"-->
|   
|   introduces the date into the document by the server.

[David Megginson]

|   I have not seen this either in the HTML or the draft HTML+ spec -- it
|   must be a local kludge which someone who didn't really know much about
|   SGML introduced for an HTML server.  This could be handled quite easily
|   several ways in an SGML document.

This is the way NCSA httpd 1.2 supports this.  Before this version it would
have been something like \<inc srv "|/bin/date">.  Since it is a revision
made by NCSA I would guess it is somewhat "official" and not "local", but
the reasoning for NCSA httpd moving to this syntax is a mystery about which
I remain curious.

-pt
</message>
<message id="<Cq7Krt.Gv7@csc.ti.com>" date="2978606441">
Newsgroups: comp.text.sgml
Date: 22 May 1994 14:40:41 UT
From: Philip Thrift \<thrift@ra.csc.ti.com>
Organization: Texas Instruments
Message-ID: \<Cq7Krt.Gv7@csc.ti.com>
References: <2rlb64$pne@search01.news.aol.com> \<Cq5zpt.4vn@csc.ti.com> \<DMEGGINS.94May22074722@aix1.uottawa.ca>
Subject: Re: Embedding LaTeX in HTML

Regarding the use of \<!--#exec cmd="/bin/date"--> syntax in HTML, this is
part of a page from the NCSA documentation, the complete set can be
found at http://hoohoo.ncsa.uiuc.edu/.

-pt
-----------------------------------------------------------------------

                    [IMAGE] NCSA HTTPD SERVER SIDE INCLUDES

   NCSA httpd allows users to create documents which provide simple
   information to clients on the fly. Such information can include the
   current date, the file's last modification date, and the size or last
   modification of other files. In its more advanced usage, it can
   provide a powerful interface to CGI and /bin/sh programs.


     _________________________________________________________________



Issues

   Having the server parse documents is a double edged sword. It can be
   costly for heavily loaded servers to perform parsing of files while
   sending them. Further, it can be considered a security risk to have
   average users executing commands as the server's User. If you disable
   the exec option, this danger is mitigated, but the performance issue
   remains. You should consider these items carefully before activating
   server-side includes on your server.


     _________________________________________________________________

... [setup omitted]

    _________________________________________________________________
Converting your old INC SRV documents to the new format

   You should use the program inc2shtml in the support subdirectory of
   the httpd distribution to translate your documents from httpd 1.1 and
   earlier to the new format. Usage is simple: inc2shtml file.html >
   file.shtml.


     _________________________________________________________________



The Include format

   All directives to the server are formatted as SGML comments within the
   document. This is in case the document should ever find itself in the
   client's hands unparsed. Each directive has the following format:

   \<!--#command tag1="value1" tag2="value2" -->

   Each command takes different arguments, most only accept one tag at a
   time. Here is a breakdown of the commands and their associated tags:

     * config

       The config directive controls various aspects of the file parsing.
       There are two valid tags:

          + errmsg controls what message is sent back to the client if an
            error includes while parsing the document. When an error
            occurs, it is logged in the server's error log.

          + timefmt gives the server a new format to use when providing
            dates. This is a string compatible with the strftime library
            call under most versions of UNIX.

          + sizefmt determines the formatting to be used when displaying
            the size of a file. Valid choices are bytes, for a formatted
            byte count (formatted as 1,234,567), or abbrev for an
            abbreviated version displaying the number of kilobytes or
            megabytes the file occupies.

     * include

       include will insert the text of a document into the parsed
       document. Any included file is subject to the usual access
       control. This command accepts two tags:

          + virtual gives a virtual path to a document on the server. You
            must access a normal file this way, you cannot access a CGI
            script in this fashion. You can, however, access another
            parsed document.

          + file gives a pathname relative to the current directory. ../
            cannot be used in this pathname, nor can absolute paths be
            used. As above, you can send other parsed documents, but you
            cannot send CGI scripts.

     * echo prints the value of one of the include variables (defined
       below). Any dates are printed subject to the currently configured
       timefmt. The only valid tag to this command is var, whose value is
       the name of the variable you wish to echo.

     * fsize prints the size of the specified file. Valid tags are the
       same as with the include command. The resulting format of this
       command is subject to the sizefmt parameter to the config command.

     * flastmod prints the last modification date of the specified file,
       subject to the formatting preference given by the timefmt
       parameter to config. Valid tags are the same as with the include
       command.

     * exec executes a given shell command or CGI script. It must be
       activated to be used. Valid tags are:

          + cmd will execute the given string using /bin/sh. All of the
            variables defined below are defined, and can be used in the
            command.

          + cgi will execute the given virtual path to a CGI script and
            include its output. The server does not perform error
            checking to make sure your script didn't output horrible
            things like a GIF, so be careful. It will, however, interpret
            any URL Location: header and translate it into an HTML
            anchor.


     _________________________________________________________________



Variables defined for Parsed Documents

   A number of variables are made available to parsed documents. In
   addition to the CGI variable set, the following variables are made
   available:

     * DOCUMENT_NAME: The current filename.

     * DOCUMENT_URI: The virtual path to this document (such as
       /~robm/foo.shtml).

     * QUERY_STRING_UNESCAPED: The unescaped version of any search query
       the client sent, with all shell-special characters escaped with \\.

     * DATE_LOCAL: The current date, local time zone. Subject to the
       timefmt parameter to the config command.

     * DATE_GMT: Same as DATE_LOCAL but in Greenwich mean time.

     * LAST_MODIFIED: The last modification date of the current document.
       Subject to timefmt like the others.


     _________________________________________________________________



   [IMAGE] Return to tutorial index
</message>
<message id="<1994May22.163825.24112@let.rug.nl>" date="2978613505">
Newsgroups: comp.text.sgml
Date: 22 May 1994 16:38:25 UT
From: Harry Gaylord \<galiard@let.rug.nl>
Organization: Faculteit der Letteren, Rijksuniversiteit Groningen, NL
Message-ID: <1994May22.163825.24112@let.rug.nl>
References: <1994May20.151629.1034@vaxb.phx1.aro.allied.com>
Subject: Re: Second Edition of Practical SGML

Look at the book by Yuri Rubinsky, The SGML Primer sold by Softquad.  In 36
pages complete with bibliography, he shows you dtds for Noah's flood and
settling the bill with friends at a steakhouse and much more.  It is great
fun to read and has all the basics in it.

--
Harry Gaylord
Rijksuniversiteit te Groningen
Groningen, The Netherlands
</message>
<message id="<9405192335.AA02123@pws09.aisg.com>" date="2978630559">
Newsgroups: comp.text.sgml
Date: 22 May 1994 21:22:39 UT
From: Mary Kroeger \<mkroeger@aisg.com>
Message-ID: <9405192335.AA02123@pws09.aisg.com>
Subject: IETMs

I am searching information on IETMs.  If you know where I can locate
information on this topic please send e-mail to mkroeger@aisg.com.

Thanks for your support.

--
Mary Kroeger
Document Analyst
Gateway Conversion Technologies, Inc.
</message>
<message id="<9405192346.AA02135@pws09.aisg.com>" date="2978630623">
Newsgroups: comp.text.sgml
Date: 22 May 1994 21:23:43 UT
From: Mary Kroeger \<mkroeger@aisg.com>
Message-ID: <9405192346.AA02135@pws09.aisg.com>
Subject: HyTime engine

O.K., I realize I may be asking a little much here, but does anyone know
where I could find a public domain or shareware HyTime engine?

This may just be wishful thinking, but any help in this area would be
greatly appreciated.

--
Mary Kroeger
Document Analyst
Gateway Conversion Technologies, Inc.
</message>
<message id="<2romt0INNlsp@oasys.dt.navy.mil>" date="2978635104">
Newsgroups: comp.text.sgml
Date: 22 May 1994 22:38:24 UT
From: Betty Harvey \<harvey@oasys.dt.navy.mil>
Organization: Carderock Division, NSWC, Bethesda, MD
Message-ID: <2romt0INNlsp@oasys.dt.navy.mil>
References: <9405192335.AA02123@pws09.aisg.com>
Subject: Re: IETMs

[Mary Kroeger]

|   I am searching information on IETMs.  If you know where I can locate
|   information on this topic please send e-mail to mkroeger@aisg.com.

The three Tri-Service Specifications: MIL-M-87268, General Content, Style,
Format and User-Interaction Requirements; MIL-D-87269, IETM Database
Requirements (this spec.  contains the Content Data Model (CDM), and
MIL-Q-87270, IETM Quality Assurance, can be FTP'd from navysgml.dt.navy.mil
(130.46.12.197) from /pub/ietm.  I have placed the three specifications in
both ASCII and Word Perfect formats.  The files are called:

gcsfui.asc  MIL-M-87268 - ASCII Format
gcsfui.wp   MIL-M-87268 - WordPerfect Format
ietmdb.asc  MIL-D-87269 - ASCII Format
ietmdb.wp   MIL-D-87269 - WordPerfect Format
ietmqa.asc  MIL-Q-87270 - ASCII Format
ietmqa.wp   MIL-Q-87270 - WordPerfect Format

If you can't get to navysgml.dt.navy.mil (we are still in the process of
getting this system setup) you can get the files from oasys.dt.navy.mil in
subdirectory /pub/ietm.  These files won't be on oasys.dt.navy.mil much
longer.  If you have trouble getting to navysgml.dt.navy.mil please drop me
an e-mail so that I can locate the problem.  Hope this helps.

			Betty

--
Betty Harvey  \<harvey@oasys.dt.navy.mil>     | David Taylor Model Basin
Advanced Information Systems Branch          | Carderock Division
Code 183                                     | Naval Surface Warfare
Bethesda, Md.  20084-5000                    |   Center
(301)227-3348   FAX (301)227-3343            | DTMB,CD,NSWC
</message>
<message id="<19940522.2347@naggum.no>" date="2978636901">
Newsgroups: comp.text.sgml
Date: 22 May 1994 23:08:21 UT
From: Erik Naggum \<erik@naggum.no>
Organization: Naggum Software; +47 2295 0313
Message-ID: <19940522.2347@naggum.no>
References: <9405192335.AA02123@pws09.aisg.com> <2romt0INNlsp@oasys.dt.navy.mil>
Subject: Re: IETMs

[Betty Harvey]

|   The three Tri-Service Specifications: MIL-M-87268, General Content,
|   Style, Format and User-Interaction Requirements; MIL-D-87269, IETM
|   Database Requirements (this spec.  contains the Content Data Model
|   (CDM), and MIL-Q-87270, IETM Quality Assurance, can be FTP'd from
|   navysgml.dt.navy.mil (130.46.12.197) from /pub/ietm.  I have placed the
|   three specifications in both ASCII and Word Perfect formats.  The files
|   are called:
|
|   gcsfui.asc  MIL-M-87268 - ASCII Format
|   gcsfui.wp   MIL-M-87268 - WordPerfect Format
|   ietmdb.asc  MIL-D-87269 - ASCII Format
|   ietmdb.wp   MIL-D-87269 - WordPerfect Format
|   ietmqa.asc  MIL-Q-87270 - ASCII Format
|   ietmqa.wp   MIL-Q-87270 - WordPerfect Format

your one-stop SGML FTP shop, ftp.ifi.uio.no:/pub/SGML, now sports an IETM
directory, with the above files (with the exception that .asc is .txt for
consistency).

best regards,
\</Erik>
--
Erik Naggum \<erik@naggum.no> \<SGML@ifi.uio.no>  |  memento, terrigena.
ISO 8879 SGML, ISO 10744 HyTime, ISO 10646 UCS  |  memento, vita brevis.
</message>
<message id="<Cq8MI6.6xp@news.Hawaii.Edu>" date="2978655341">
Newsgroups: comp.text.sgml
Date: 23 May 1994 04:15:41 UT
From: Torben Noerup Nielsen \<torben@foralie.net.Hawaii.Edu>
Organization: University of Hawaii
Message-ID: \<Cq8MI6.6xp@news.Hawaii.Edu>
Keywords: Superscript and subscript; mathematics
Subject: ISO 12083

Recently I sent out a query asking for information about the AAP/ISO 12083
DTD, CALS tables and ISO standards for mathematics.  Thanks to everyone who
responded; I got quite a few responses and I now have the ISO 12083
document from GCA.

However, I have run into a problem.  First, there are no examples of
mathematics in the ISO 12083 document and I have been unable to get a hold
of any.  The mathematics part of the DTD isn't even explained in detail in
the document.  Does anyone know of a source for a sample mathematics
document?  Preferably something I could use with DynaText.....

Also, in trying to convert a document from the DocBook DTD to ISO 12083, I
ran into a serious problem.  I cannot seem to find any facility for
generating subscripts and superscripts in ISO 12083.  I'm not sure if it's
just me who cannot read (possible!) or such things simply aren't there.

I need both subscripts and superscripts.  Mostly for mathematical formulas,
but also to represent phrases like ``19th" properly; i.e., I would like the
``th" part to be in a smaller font and raised a bit above the line.  Using
the DocBook DTD, I have been encoding this is ``19\<SUP>th\</SUP>".  Can
anyone help?

Thanks, Torben
</message>
<message id="<1994May23.125603.13447@dxcern.cern.ch>" date="2978686563">
Newsgroups: comp.text.sgml
Date: 23 May 1994 12:56:03 UT
From: Eric van Herwijnen \<evh@dxcern.cern.ch>
Organization: CERN European Laboratory for Particle Physics
Message-ID: <1994May23.125603.13447@dxcern.cern.ch>
References: \<Cq8MI6.6xp@news.Hawaii.Edu>
Keywords: Superscript and subscript; mathematics
Subject: Re: ISO 12083

The "annotated ISO12083", which contains the entire text of ISO 12083 in
electronic form, will be bundled with the paper version of ISO 12083.  It
is an electronic book in DynaText, and includes many examples of
mathematical formula's.  Mail me on 100043.3201@compuserve.com if you are
interested to order.

Eric van Herwijnen
</message>
<message id="<9405231308.AA16611@source.asset.com>" date="2978687291">
Newsgroups: comp.text.sgml
Date: 23 May 1994 13:08:11 UT
From: Len Bullard \<bullardc@source.asset.com>
Message-ID: <9405231308.AA16611@source.asset.com>
References: \<Cq5Jy0.Mx@csc.ti.com> <2rlb64$pne@search01.news.aol.com>
Subject: Re: Embedding LaTeX in HTML

Eliot, has anyone ever documented the features of SGMLS beyond the bit of
readme data originally supplied?  It would seem that given the rapid growth
in usage of the parser and the other utilities that have grown up around
it, some kind of authoritative documentation, examples, etc. would be most
helpful.  Our programmers confess that the SGML source is abstract at best,
not impossible, but resource consuming to digest.  I understand that while
free, one takes what one can get, but perhaps in the second edition of your
book, and appendix containing your e-mail snippets and some words to tie it
all together would be appreciated.

Len
</message>
<message id="<2rqb34$149@search01.news.aol.com>" date="2978688548">
Newsgroups: comp.text.sgml
Date: 23 May 1994 13:29:08 UT
From: Eliot Kimber \<drmacro@aol.com>
Message-ID: <2rqb34$149@search01.news.aol.com>
References: <9405192346.AA02135@pws09.aisg.com>
Subject: Re: HyTime engine

[Mary Kroeger]

|   O.K., I realize I may be asking a little much here, but does anyone
|   know where I could find a public domain or shareware HyTime engine?

There's not one (unless you count the very crude Perl Script I just put on
the SGML FTP server).  TechnoTeacher's MarkMinder/HyMinder product, which
is the most complete HyTime application enabler yet available, is now quite
attractively priced, and considering the functionality it provides, a
bargain I think.  You can get more information by sending email to
HyMinder@techno.com.  The price list I have, dated Feb 71, 1994, shows
HyMinder licenses (which includes the MarkMinder component) starting at 3K
dollars US for a developer's license, with prices going down for greater
volumes.

The ObjectSGML project announced at SGML '93 will be a public domain HyTime
parser that will provide some HyTime functionality, but won't be what most
people would consider a complete "engine.  ObjectSGML is still under
development and I don't know what the expected availability date is.

--
\<Address HyTime=bibloc>
W. Eliot Kimber (kimber@passage.com) Systems Analyst and HyTime Consultant
Passage Systems, Inc., 9971 Quail Blvd., Suite 903, Austin TX 78758,+1 512 339 1400
465 Fairchild Dr., Suite 201, Mountain View, CA  94043, +1 415 390 0911
\</Address>
</message>
<message id="<9405231332.AA17749@wpdis03.wpafb.af.mil>" date="2978688725">
Newsgroups: comp.text.sgml
Date: 23 May 1994 13:32:05 UT
From: "Richard G. Littleton" \<rlittlet@wpdis03.wpafb.af.mil>
Message-ID: <9405231332.AA17749@wpdis03.wpafb.af.mil>
References: <9405192335.AA02123@pws09.aisg.com> <2romt0INNlsp@oasys.dt.navy.mil>
Subject: Re: IETMs

[Betty Harvey]

|   The three Tri-Service Specifications: MIL-M-87268, General Content,
|   Style, Format and User-Interaction Requirements; MIL-D-87269, IETM
|   Database Requirements (this spec. contains the Content Data Model
|   (CDM), and MIL-Q-87270, IETM Quality Assurance, can be FTP FTP'd from
|   navysgml.dt.navy.mil (130.46.12.197) from /pub/ietm.  I have placed the
|   three specifications in both ASCII and Word Perfect formats.  The files
|   are called:
|
|   gcsfui.asc  MIL-M-87268 - ASCII Format
|   gcsfui.wp   MIL-M-87268 - WordPerfect Format
|   ietmdb.asc  MIL-D-87269 - ASCII Format
|   ietmdb.wp   MIL-D-87269 - WordPerfect Format
|   ietmqa.asc  MIL-Q-87270 - ASCII Format
|   ietmqa.wp   MIL-Q-87270 - WordPerfect Format

Keep in mind that the hard copy from the Navy Publishing and Printing
Service Office is the only contractually binding version of these
documents.
</message>
<message id="<9405231805.AA62856@source.asset.com>" date="2978705153">
Newsgroups: comp.text.sgml
Date: 23 May 1994 18:05:53 UT
From: Len Bullard \<bullardc@source.asset.com>
Message-ID: <9405231805.AA62856@source.asset.com>
Subject: ISO 10744

This question probably gets asked a few dozen times a month,
but, how and from where does one order copies of ISO 10744?
</message>
<message id="<19940523.2384@naggum.no>" date="2978715448">
Newsgroups: comp.text.sgml
Date: 23 May 1994 20:57:28 UT
From: Erik Naggum \<erik@naggum.no>
Organization: Naggum Software; +47 2295 0313
Message-ID: <19940523.2384@naggum.no>
References: <9405231805.AA62856@source.asset.com>
Subject: Re: ISO 10744

[Len Bullard]

|   This question probably gets asked a few dozen times a month,
|   but, how and from where does one order copies of ISO 10744?

officially, through ANSI.

    American National Standards Institute
    11 West 42nd St.
    New York, NY 10036
    USA
    PH: 212-642-4900
    FX: 212-302-1286

they wanted USD 132.00 for it in early April this year.

see also Heather Davenport's standards list:

    ftp.ifi.uio.no:/pub/SGML/standards

last updated 1994-04-07.

best regards,
\</Erik>
--
Erik Naggum \<erik@naggum.no> \<SGML@ifi.uio.no>  |  memento, terrigena.
ISO 8879 SGML, ISO 10744 HyTime, ISO 10646 UCS  |  memento, vita brevis.
</message>
<message id="<9405232107.AA81911@source.asset.com>" date="2978716056">
Newsgroups: comp.text.sgml
Date: 23 May 1994 21:07:36 UT
From: Len Bullard \<bullardc@source.asset.com>
Message-ID: <9405232107.AA81911@source.asset.com>
Subject: HyTime and CGM

An interesting idea is being examined and I wonder if anyone else is
working to define the issues involved.

HyTime defines hypermedia addressing for SGML applications. The Computer
Graphics Metafile (CGM) group has proposed mechanisms for hyperlinks from
CGM files.  As yet, I am not aware of the details of how this is defined in
CGM.  The issue: how do these two definitions for hyperlinking coexist and
cooperate?  HyTime provides a notation-driven mechanism by which one passes
arguments to an application in a foreign notation?  How would CGM do this,
and what would this mean to the BOS?

Do we have two standards with overlap?  Do we have two mutually exclusive
definitions?  Do we have the usual mess that occurs when non-coordinated
efforts collide?

The last comment is simple frustration.  The reality is that intelligent
graphic objects are just as valid as intelligent text objects, but the
nightmare of testing and proving that these objects in concert meet the
requirements of an IETM may be tricky.  How does the acquisition and
contracts personnel write a requirement in which each application performs
some part of the task of the IETM without overlap and without confusing the
authors?

I propose that this discussion is part of the overall problem of specifying
behavior in hypermedia and should be examined in this forum.  The time when
we can consider the graphic as a simple notation that is processed and
returns control to the SGML application is closing as we speak.

Len Bullard
</message>
<message id="<19940523.2389@naggum.no>" date="2978720658">
Newsgroups: comp.text.sgml
Date: 23 May 1994 22:24:18 UT
From: Erik Naggum \<erik@naggum.no>
Organization: Naggum Software; +47 2295 0313
Message-ID: <19940523.2389@naggum.no>
References: <2rlb64$pne@search01.news.aol.com> \<Cq5zpt.4vn@csc.ti.com> \<DMEGGINS.94May22074722@aix1.uottawa.ca> \<Cq7Hzu.GAC@csc.ti.com>
Subject: Re: Embedding LaTeX in HTML

[Philip Thrift]

|   This is the way NCSA httpd 1.2 supports this. Before this version it
|   would have been something like \<inc srv "|/bin/date">. Since it is a
|   revision made by NCSA I would guess it is somewhat "official" and not
|   "local", but the reasoning for NCSA httpd moving to this syntax is a
|   mystery about which I remain curious.

where is the point of authority on HTML and httpd located these days?

it used to be with a relatively competent team at CERN, but the more I look
at all the nifty features that show up in various browsers, the increasing
requirements to use specific browsers to obtain these features, and the
commercial license agreements on the browser's environments, I become a
little concerned that we may once again have bought nifty features at the
expense of our information.

SGML provides a well-defined mechanism to identify processing-specific
information, if required.  why do programmers then invent stupid hacks to
solve the same problem?  are they just unaware of the standard (if they
example you gave above is actually what they did, it's an SGML syntax
violation, too), don't they care, or are they actively trying to corner the
market by requiring others to buy their product, which they will "upgrade"
at various intervals to invalidate millions of networked documents?

don't get me wrong -- I think WWW and HTML are great.  only... I wish it
was more of an information-driven thing than a product-driven thing.  I
hate GUI (gooey) programs, and using the otherwise excellent lynx program
to access the WWW is now largely hopeless.  firing up Mosaic requires
Motif, which _really_ sucks, not to mention eats bandwidth by the megabit
even on fast X terminals, and I'm running on my home machine because Motif
is licenseware, and I don't want to pay a company to get free access to
free information.

I don't think programmers will stop doing what comes naturally -- make
things that are cool and fun to use -- but their users must be on guard to
halt the "progress" into oblivion for their information.  the time will
come (it may have come, already) when it will not be a viable option to
"upgrade" all the HTML files along with every release of NCSA Mosaic or
other tools.  the permanent solution is of course to have all the data in
real SGML, and regard HTML as an output format.

then again, what if HTML used architectural forms instead of whole
elements, so we could add HTML to real SGML documents?  yeah, I know, I'm
dreaming.

best regards,
\</Erik>
--
Erik Naggum \<erik@naggum.no> \<SGML@ifi.uio.no>  |  memento, terrigena.
ISO 8879 SGML, ISO 10744 HyTime, ISO 10646 UCS  |  memento, vita brevis.
</message>
<message id="<JOLLIE.94May23202222@grant.cs.uiowa.edu>" date="2978731341">
Newsgroups: comp.text.sgml
Date: 24 May 1994 01:22:21 UT
From: "Jeffrey C. Ollie" \<jollie@grant.cs.uiowa.edu>
Organization: The University of Iowa, Iowa City, Iowa, USA
Message-ID: \<JOLLIE.94May23202222@grant.cs.uiowa.edu>
Subject: HTML/SGML/Inclusions, etc.

Having some experience as a WWW information provider, I'd like to make some
comments about some of the topics that have come up lately.

1.  Server-side processing of HTML documents.  The most recent versions of
    NCSA Mosaic embed server side processing directives in SGML comments.
    There are a number of reasons for this (which were outlined in the post
    of the NCSA server documentation, but I'd like to condense/comment).

    A.  The server should *not* be a SGML compliant parser!  Yes, SGML has
	some well-defined methods of processing specific information, but
	imagine the load on the server if it had to completely parse the
	document!

    B.  Embedding processing directives for the server in SGML comments
	makes sense since the information is easily located (everything
	between "\<!--#" and "-->") and it is completely ignored by browsers
	if the server does not process it for some reason.

    C.  The notation "\<inc srv "...">" used in pre-1.2 versions of the NCSA
	WWW server was unfortunate in that it encouraged people to think
	that this was SGML.

    D.  The server ought to be seen in a light similar to the C
	pre-processor.  It performs some magic outside the scope of the
	language before presenting it to the language processor.

    E.  WWW servers serve more than HTML.  They serve images, text, movies,
	sound, and act as gateways between databases.  Perhaps those of us
	that are concerned about the non-SGMLness of current browsers
	should put our coding skills where our mouth is and develop a
	browser that is SGML compliant.

2.  Inclusions of LaTeX, etc.  The method for doing this should depend on
    what does the processing.  If the processing is done on the server, a
    non-SGML method is probably best (see above).  If the processing is
    done by the client, SGML methods ought to be used to encode this, since
    the client is supposed to be (at least in theory) a SGML parser with a
    fixed DTD.

3.  The non-SGMLness of HTML and the subsets/supersets implemented by the
    various browsers.  This is indeed unfortunate.  The problems began
    early on when HTML was described and implemented by people who
    apparently understood only enough of SGML to get themselves into
    trouble.  This problem has only been compounded by the millions of
    people who are attracted by the great power and utility of a system
    like the Web, but know little or nothing about SGML.  The fact that all
    of the available browsers are extremely lenient has not helped matters.

    HTML+ is a step in the direction of a more SGML compliant Web, but
    browsers (especially Mosaic) are behind in implementing all of its
    features.  However, problems will persist while the knowledge of SGML
    continues to lack.

I apologize if this sounds like somewhat of a diatribe, but I felt it
necessary to make some of these points.  Asbestos underwear on!

--
Jeffrey C. Ollie
jeffrey-ollie@uiowa.edu
http://caesar.cs.uiowa.edu/~jollie/
</message>
<message id="<2rrp98INNacl@oasys.dt.navy.mil>" date="2978735848">
Newsgroups: comp.text.sgml
Date: 24 May 1994 02:37:28 UT
From: Betty Harvey \<harvey@oasys.dt.navy.mil>
Organization: Carderock Division, NSWC, Bethesda, MD
Message-ID: <2rrp98INNacl@oasys.dt.navy.mil>
References: <2rlb64$pne@search01.news.aol.com> <19940523.2389@naggum.no>
Subject: Re: Embedding LaTeX in HTML

[Erik Naggum]

|   don't get me wrong -- I think WWW and HTML are great.  only... I wish
|   it was more of an information-driven thing than a product-driven thing.
|   I hate GUI (gooey) programs, and using the otherwise excellent lynx
|   program to access the WWW is now largely hopeless.  firing up Mosaic
|   requires Motif, which _really_ sucks, not to mention eats bandwidth by
|   the megabit even on fast X terminals, and I'm running on my home
|   machine because Motif is licenseware, and I don't want to pay a company
|   to get free access to free information.

Like SGML tools, Word Processors and operating systems, there just isn't a
GUI to end all GUI's.  My best operating system would be VMS with the
navigational abilities of UNIX.  The best Word Processor is (supply your
own).

|   then again, what if HTML used architectural forms instead of whole
|   elements, so we could add HTML to real SGML documents?  yeah, I know,
|   I'm dreaming.

In the beginning of WWW and HTML I thought that it could be the best thing
that could happen to SGML.  It might not have started out being the best
SGML but it was an application of SGML that worked.  It still believe that
it is good for SGML.  All of a sudden SGML is being talked about.  SGML
experts are getting involved in the HTML effort which will not only help
HTML but the SGML community at large.

We need to realize that there are several different types of data.  Two
types of data are that we want to keep for all eternity and throw away data
that has a short life expectancy.  The current HTML is perfectly suited to
the short term information.  It is easy to author, tools are readily
available to help in the authoring or conversion process and best of all
its cheap.

However, on the other end of the spectrum if we want to author data using
SGML and present this data either electronically or in paper form, it is
not easy, the tools are difficult to use, legacy data doesn't fit easily
and it is almost cost restrictive.

Most of the Internet community never heard of SGML until they became aware
of the WWW.  Haven't you noticed the change in comp.text.sgml.  9 months
ago when reading about the WWW and HTML, SGML was never mentioned.  People
in the general Internet community weren't aware that SGML was used as the
principle for HTML.  HTML was always mentioned as a separate entity.  As
the community is becoming educated SGML is mentioned as the backbone for
HTML.

I think the SGML community can help mold and facilitate the HTML DTD to
become a truly robust, useful and successful (it is already successful in
my book) SGML application.

From my point of view I haven't seen very many successful SGML publication
applications.  We seem to be having better successes in electronic
applications of SGML than in the traditional paper publication
environments.

--
Betty Harvey  \<harvey@oasys.dt.navy.mil>     | David Taylor Model Basin
Advanced Information Systems Branch          | Carderock Division
Code 183                                     | Naval Surface Warfare
Bethesda, Md.  20084-5000                    |   Center
(301)227-3348   FAX (301)227-3343            | DTMB,CD,NSWC
</message>
<message id="<19940524.2396@naggum.no>" date="2978736722">
Newsgroups: comp.text.sgml
Date: 24 May 1994 02:52:02 UT
From: Erik Naggum \<erik@naggum.no>
Organization: Naggum Software; +47 2295 0313
Message-ID: <19940524.2396@naggum.no>
References: \<JOLLIE.94May23202222@grant.cs.uiowa.edu>
Subject: Re: HTML/SGML/Inclusions, etc.

[Jeffrey C. Ollie]

|   A.  The server should *not* be a SGML compliant parser!  Yes, SGML has
|	some well-defined methods of processing specific information, but
|	imagine the load on the server if it had to completely parse the
|	document!

well, I can not only imagine this, I speak of experience.  parsing SGML is
fast and painless.  _validating_ SGML is slow and painful.  ISO 8879 does
_not_ require you to validate a document.  validation is explicitly left as
an option.

now, how about some features?  omitted tag minimization is a little
expensive.  short tag minimization is a little more expensive.  short
references are inexpensive.  entity references are inexpensive.

what drags a system down is trying to be smart about error recovery, which
is horribly expensive in SGML.  if you don't recover from errors, you can
skip all the work in this department and breeze through a document.

now, think about it: if you need error recovery in a server that reads
static documents, error recovery is not your biggest problem.  so, validate
the document up front, relieve the parser of this job, and you will be able
to process SGML without any of the mythical load attributed to SGML parsing.

|   B.  Embedding processing directives for the server in SGML comments
|	makes sense since the information is easily located (everything
|	between "\<!--#" and "-->") and it is completely ignored by browsers
|	if the server does not process it for some reason.

ditto for processing instructions.

|   E.  WWW servers serve more than HTML.  They serve images, text, movies,
|	sound, and act as gateways between databases.  Perhaps those of us
|	that are concerned about the non-SGMLness of current browsers
|	should put our coding skills where our mouth is and develop a
|	browser that is SGML compliant.

maybe we already have.  besides it's not the browser as a whole that needs
to be SGML compliant, only its input language processor.  what you do with
the information afterwards is completely up to you.  thus, with a nice,
clean layer between SGML parser and application software, you don't have
any of the problems embedded parsing poses today.

best regards,
\</Erik>
--
Erik Naggum \<erik@naggum.no> \<SGML@ifi.uio.no>  |  memento, terrigena.
ISO 8879 SGML, ISO 10744 HyTime, ISO 10646 UCS  |  memento, vita brevis.
</message>
<message id="<16FC0A657S85.UPP201@ibm.rhrz.uni-bonn.de>" date="2978761783">
Newsgroups: comp.text.sgml
Date: 24 May 1994 09:49:43 UT
From: Michael Burschik \<UPP201@ibm.rhrz.uni-bonn.de>
Organization: RHRZ Uni-Bonn
Message-ID: <16FC0A657S85.UPP201@ibm.rhrz.uni-bonn.de>
Subject: Problem with article.dtd and psgml

I am using psgml to edit SGML files, but I have this problem: having
successfully parsed the ISO article DTD, psgml won't let me insert \<emph>
or any other sub-paragraph element.  SGMLS seems to back psgml up on this
point.  What am I doing wrong?  Or did I simply misinterpret the DTD?

Cheers,
        Mike.
</message>
<message id="<9405241301.AA41169@source.asset.com>" date="2978773318">
Newsgroups: comp.text.sgml
Date: 24 May 1994 13:01:58 UT
From: Len Bullard \<bullardc@source.asset.com>
Message-ID: <9405241301.AA41169@source.asset.com>
References: <2rlb64$pne@search01.news.aol.com> \<Cq5zpt.4vn@csc.ti.com> \<DMEGGINS.94May22074722@aix1.uottawa.ca> \<Cq7Hzu.GAC@csc.ti.com> <19940523.2389@naggum.no>
Subject: Re: Embedding LaTeX in HTML

Dream away, Erik.  Only the dreamers speak the words that otherwise are
lost on the tides of circumstance and indecision.  How many of us that
straddle the worlds of design and standardization find that the committees
want to set the agenda without understanding the issues, or have taken a
standard that took thousands of hours worth of effort to the programmers
only to find that same paper stuffed in our garbage can a few hours later
because the coder couldn't find his latest MFC objects mentioned in it.

This is one of the oldest struggles of our kind.  Socrates admonished
Alcibiades whom he loved not to always turn to self-interest, but
Alcibiades still chose to betray his city to the enemy of his love.  Ayn
Rand wrote thousands of pages to demonstrate the case that selfishness is a
virtue.  And so it goes...  But do not cease to dream or speak those dreams
to others.  In those dreams is all your freedom, and in those words, all
our hopes to be free.

Cheers,

Len
</message>
<message id="<9405241344.AA41168@source.asset.com>" date="2978775888">
Newsgroups: comp.text.sgml
Date: 24 May 1994 13:44:48 UT
From: Len Bullard \<bullardc@source.asset.com>
Message-ID: <9405241344.AA41168@source.asset.com>
References: \<JOLLIE.94May23202222@grant.cs.uiowa.edu> <19940524.2396@naggum.no>
Subject: Re: HTML/SGML/Inclusions, etc.

The discussion of embedding processing directives sounds very much like the
issues we are debating for embedding behavior (e.g., dialogs) into IETM
instances.  Why haven't xenoforms or marked functions been suggested?  I'm
told that Dr. Goldfarb and Yuri Rubinsky have made some suggestions in the
direction of using event schedules in HTML.  I don't have any of the
details so that may be in error.  In any event (pardon the pun), it seems
that if HTML has a need to be an ISO-conforming application, and is a
hypermedia application, it should use the ISO standard 10744 as a basis for
its features.  If for technical reasons this is not possible, these reasons
should be delineated in this forum so that those of us who are embarking on
building HyTime applications can become more aware of the pitfalls, if any.
If this is not the case, then the designers and implementors or HTML should
simply state that it is their intent to create a free but proprietary
application and that users who choose to put their data in the HTML formats
must be aware of the consequences, if any.

I concur with Betty Harvey that applications such as HTML can advance the
cause of SGML, but I don't think that was a reason for creating it.  On the
other hand, applications of SGML are just applications and only in their
services will they find acceptance.  If one of those services is to provide
fully-compliant ISO support, then the standards are important.  If they are
only intended to be used as browsers without regard to long term data
preservation and reuse, then standard is useful insomuch as it supports the
software.

A system implementor can choose the formats for the data based on:

o Probability of reuse - how many times in a period is data accessed, how
  many and what types of links are made to the data, what are the functions
  associated with the links.

o Frequency of update - what is the rate of change of the data and in what
  scales

Actual reuse is determined by the number of documents that use a node as
part of its content.  Sometimes this reuse is by reference, and at other
times it is by value.  The second option has a higher maintenance cost but
may have a lower access cost.  Reuse and update couple in the transaction
cost.  From these measures, the implementor can partially predict the cost
of the transactions.  Transaction cost cannot be predicted in isolation
from the process which requires the system: the mission.  To put this as
succinctly as my limited math skill allows:

1.  For a system (S) there exists N subsystems (S subscript n) whose
    Readiness level (R sub l) is maintained by K topics (T sub k) where the
    relationships among S sub n and T sub K are expressed as C classes of
    Links (L sub C).

2.  For S sub n, a set of observers (O sub j) exists whose relationships to
    S sub N T sub K are expressed as a subset of L sub C during Event (E
    sub st) where st is the space/time extent when Input conditions I sub v
    are in effect (v is value).

3.  For S, the qualified Mission capability (M) equals the total number of
    S sub N T sub K squared minus the sum of S sub n T sub k where T sub k
    has negative values ( i.e., do not exist or cannot be addressed).

Dynamism in the hypermedia is achieved by behavioral branching under the
control of conditions whose values are input by users of the system within
the constraints and opportunities offered by a particular environment.

For the moment, we can avoid the non-linear dynamic aspects of event-driven
environments.  What we cannot avoid are the potential chaotic aspects of
system which do not carefully encode the address space and the processes
which create and modify this space.  Only the standard serves as a strong
coupler of mission and artifice.  It is in this respect that we plead for
the implementors of HTML to use the standards to the largest degree that is
practical.

Len Bullard
</message>
<message id="<CqB87D.GKt@csc.ti.com>" date="2978776776">
Newsgroups: comp.text.sgml
Date: 24 May 1994 13:59:36 UT
From: Philip Thrift \<thrift@ra.csc.ti.com>
Organization: Texas Instruments
Message-ID: \<CqB87D.GKt@csc.ti.com>
References: \<Cq7Hzu.GAC@csc.ti.com> <19940523.2389@naggum.no> <9405241301.AA41169@source.asset.com>
Subject: Re: Embedding LaTeX in HTML

Thanks for all the feedback on this issue.  I started this describing my
current system called 'htmx2html' that processes HTML files with embedded
LaTeX like:

   \<!--#tex
       \\[ e^{i\\pi} + 1 = 0  \\]
     -->

whereas it would be better to use processing instructions like:

   \<?#tex
       \\[ e^{i\\pi} + 1 = 0  \\]
    >

(There was no feedback on comp.infosystems.www, BTW).

-pt
</message>
<message id="<9405241524.AA63728@source.asset.com>" date="2978781858">
Newsgroups: comp.text.sgml
Date: 24 May 1994 15:24:18 UT
From: Len Bullard \<bullardc@source.asset.com>
Message-ID: <9405241524.AA63728@source.asset.com>
References: \<Cq7Hzu.GAC@csc.ti.com> <19940523.2389@naggum.no> <9405241301.AA41169@source.asset.com> \<CqB87D.GKt@csc.ti.com>
Subject: Re: Embedding LaTeX in HTML

Why not try:

\<xenoproc id="xp1" notation=latex>
\\[ e^{i\\pi} + 1 = 0  \\
\</xenoproc>

then the parser could do a bit of checking before processing it and any
system dependencies could be isolated.  If the xenoform GI indicated the
purpose of the process, then one could also have a reconciliation strategy
when another system did not process LaTex.  Would this approach also work?

Len Bullard
</message>
<message id="<2rtdbq$hjj@nrcnet0.nrc.ca>" date="2978789178">
Newsgroups: comp.text.sgml
Date: 24 May 1994 17:26:18 UT
From: Russ Thomas \<thomas@irc.lan.nrc.ca>
Organization: National Research Council Canada
Message-ID: <2rtdbq$hjj@nrcnet0.nrc.ca>
References: <9405231805.AA62856@source.asset.com>
Subject: Re: ISO 10744

[Len Bullard]

|   This question probably gets asked a few dozen times a month,
|   but, how and from where does one order copies of ISO 10744?

[Erik Naggum]

|   officially, through ANSI.
|
|       American National Standards Institute
|       11 West 42nd St.
|       New York, NY 10036
|       USA
|       PH: 212-642-4900
|       FX: 212-302-1286
|
|   they wanted USD 132.00 for it in early April this year.

You can also obtain them directly from ISO in Geneve as well as from your
National Standards Body.  You can email directly to ISO Sales for further
information at:

    sales@isocs.iso.ch

Regards,

Russ.

--
Dr J Russell Thomas,                         Phone: (613) 993 - 0817
National Research Council of Canada,           Fax: (613) 952 - 7673
I.R.C. - ILB,                              email:thomas@irc.lan.nrc.ca
Building M-20,                                  : russ@erie.irc.nrc.ca
Montreal Road, Ottawa, Ontario K1A 0R6
Canada.
</message>
<message id="<CqBJ8u.B6s@unx.sas.com>" date="2978791086">
Newsgroups: comp.text.sgml,comp.text,comp.lang.misc,comp.lang.icon
Date: 24 May 1994 17:58:06 UT
From: Gary Merrill \<sasghm@theseus.unx.sas.com>
Organization: SAS Institute Inc.
Message-ID: \<CqBJ8u.B6s@unx.sas.com>
References: <1994May17.140242.27932@tin.monsanto.com> \<CpyDI0.M9M@world.std.com>
Subject: Re: Spitbol reference?

[Joel Finkle]

|   I'm looking for reference material on the Spitbol language.

[John Smith]

|   The vendor for Spitbol is:
|
|   	Catspaw Inc.
|   	PO Box 1123
|   	Salida, CO 81201 USA
|   	+1 719 539 3884
|
|   cats!support@cs.arizona.edu

I recall seeing the Catspaw SPITBOL environment some time ago, and though I
never used it, it looked quite nice.

|   --
|   \<!--
|     "To imagine a language is to imagine a form
|      of life."                   - Wittgenstein,
|                    Philosophical Investigations

Yes, and the form of life of which SNOBOL is a model is likely to be quite
foreign to you.  It would be good to have a "pet" SNOBOL (or SPITBOL)
interpreter to help you get up to speed.

As far as references for the *langauge*, the following are gathering dust
on my shelves:

    _The SNOBOL4 Programming Language, 2nd Ed., by Griswold, Poage, and
    Polonsky.  Prentice Hall.

This is the classic language reference text.

    _The Programmer's Introduction to SNOBOL_ by Maurer.  Elsevier.

This is part of a series in which each book seeks to introduce someone
already skilled in programming to a new language.

    _String and List Processing in Snobol4: Techniques and Applications_,
    by Griswold.  Prentice-Hall.

Probably my favorite text on the language, this deals with somewhat more
advanced topics and algorithms.

Isn't it great that SGML is providing such a rich environment for the
application of what many of us might have thought of as dead languages and
elderly sets of text processing tools?  A programmer's paradise.  I hope
you enjoy your use of SPITBOL.  It is one of my favorite languages, though
one hardly sees it used much any more at all.  Perhaps this type of
application to SGML environments will stimulate a revival.

--
Gary H. Merrill  [Principal Systems Developer, Compiler and Tools Division]
SAS Institute Inc. / SAS Campus Dr. / Cary, NC  27513 / (919) 677-8000
sasghm@theseus.unx.sas.com ... !mcnc!sas!sasghm
</message>
<message id="<2rth02$eid@darwin.nbnet.nb.ca>" date="2978792898">
Newsgroups: comp.text.sgml
Date: 24 May 1994 18:28:18 UT
From: Mike Hodgson \<support@nbnet.nb.ca>
Organization: ACT Support Services
Message-ID: <2rth02$eid@darwin.nbnet.nb.ca>
Subject: Training & SGML: Request for Info.

Has anyone in this newsgroup ever used SGML for creating computer based
training applications?  Specificially has anyone used SGML with an
authoring system, simulators, testbank, etc?  The authoring systems that we
use use displays with vector and bit mapped graphics and we wantd to find
out if it was possible to find an authoring system that uses these and SGML
at the same time.

Also, has anyone ever used SGML for access to a relational database system
or an AI (expert system) application?  If you have, where did you get it
and how much did it cost?

I would appreciate any comments.

TIA.
--
Mike Hodgson, ACT Support Services.
support@nbnet.nb.ca
</message>
<message id="<JOLLIE.94May24155725@grant.cs.uiowa.edu>" date="2978801845">
Newsgroups: comp.text.sgml
Date: 24 May 1994 20:57:25 UT
From: "Jeffrey C. Ollie" \<jollie@grant.cs.uiowa.edu>
Organization: The University of Iowa, Iowa City, Iowa, USA
Message-ID: \<JOLLIE.94May24155725@grant.cs.uiowa.edu>
References: \<JOLLIE.94May23202222@grant.cs.uiowa.edu> <31326.smithn@orvb.saic.com>
Subject: Re: HTML/SGML/Inclusions, etc.

[Norman E Smith]

|   As someone also with experience as an "WWW information provider", I
|   have to disagree with many of the comments that Jeffrey Ollie makes
|   about the server "parsing" documents.

It's a free world...

|   I have set up the server for a customer as an "SGML" system.  Our
|   documents are parsed before being placed on the Web Server and
|   hyperlinks resolved via SGML entities.  There is no need for the server
|   to parse these documents in an SGML sense again when they are accessed.
|   So, there is no additional load on the server for processing all of our
|   HTML documents as proper SGML.
|   
|   The server has to parse the tags whether the information is represented
|   in proper SGML form or whether embedded inside comments.

Why does the server need to parse the tags?  All it needs to do is look for
the special strings "\<!--#" and "-->" (on the NCSA version) and then
interpret the information between, which is not SGML.

|   I agree with Erik, I don't like this approach from an SGML point of
|   view.  As far as I can tell, this approach is similar to the one that
|   Adobe took with PostScript. The language is defined in concrete, well
|   ROM, and must have the same operation regardless of the printer
|   brand. Vendor specific features have tended to be added as special
|   comments...

You miss my point.  The server should understand nothing of SGML.  What if
I wanted the server to preprocess some ASCII text (adding a time stamp,
etc.).  The only justification for using "\<!--#" and "-->" to delimit
server processing directives is to ensure that it has no meaning at all to
the HTML browser.

Maybe someone will come up with a server that runs everything through the
m4 preprocessor.  Server-side directives depend on the server, so they
should not be part of HTML!

|   There is no connection between having HTML be proper SGML and the fact
|   that servers contains multiple types of data. SGML has the mechanisms
|   to reference external data and HTML uses it (\<img src="">).

I never said that HTML shouldn't be proper SGML.  I said that server
processing directives shouldn't be a part of HTML.  The mechanism that you
use as an example is processed by the client only.

|   Again, neither the client nor the server have to be SGML parsers!  They
|   only have to parse specific tags from the input stream, which is
|   considerably simpler!  A Web server is essentially read-only from the
|   client's prospective.  If a document is SGML parsed before it is loaded
|   onto the server, then it is valid and does not have to be SGML parsed
|   every time a client retrieves it.  Nor does the client have to SGML
|   parse the document.  The current state of HTML documents makes Client
|   SGML parsing of documents before rendering an almost impossible
|   proposition.

Yes, the client does have to be a SGML parser (if it intends to display
anything but the raw HTML).  As Eric pointed out, the client does not need
to be an SGML validator, which he says is a completely different beast (and
I tend to agree).

--
Jeffrey C. Ollie
jeffrey-ollie@uiowa.edu
http://caesar.cs.uiowa.edu/~jollie/
</message>
<message id="<1994May24.212757.11431@midway.uchicago.edu>" date="2978803677">
Newsgroups: comp.text.sgml,comp.text,comp.lang.misc,comp.lang.icon
Date: 24 May 1994 21:27:57 UT
From: "Richard L. Goerwitz" \<goer@midway.uchicago.edu>
Organization: University of Chicago
Message-ID: <1994May24.212757.11431@midway.uchicago.edu>
References: <1994May17.140242.27932@tin.monsanto.com> \<CpyDI0.M9M@world.std.com> \<CqBJ8u.B6s@unx.sas.com>
Subject: Re: Spitbol reference?

[Gary Merrill]

|       _The SNOBOL4 Programming Language, 2nd Ed., by Griswold, Poage, and
|       Polonsky.  Prentice Hall.
|   
|   This is the classic language reference text.

It's also out of print!

--
   -Richard L. Goerwitz              goer%midway@uchicago.bitnet
   goer@midway.uchicago.edu          rutgers!oddjob!ellis!goer
</message>
<message id="<1994May24.213549.12011@midway.uchicago.edu>" date="2978804149">
Newsgroups: comp.text.sgml
Date: 24 May 1994 21:35:49 UT
From: "Richard L. Goerwitz" \<goer@midway.uchicago.edu>
Organization: University of Chicago
Message-ID: <1994May24.213549.12011@midway.uchicago.edu>
Subject: SGML editors

I've stayed on the outside of the SGML community, waiting to see what goes
down in the way of practical software.

Quick question: Is there an SGML editor that allows practical word
processing in an office and/or scholarly environment, especially a
multi-lingual one?

--
   -Richard L. Goerwitz              goer%midway@uchicago.bitnet
   goer@midway.uchicago.edu          rutgers!oddjob!ellis!goer
</message>
<message id="<CqBz78.Dst@world.std.com>" date="2978811764">
Newsgroups: comp.text.sgml,comp.text,comp.lang.misc,comp.lang.icon
Date: 24 May 1994 23:42:44 UT
From: John Smith \<sgmlsh@world.std.com>
Organization: The World Public Access UNIX, Brookline, MA
Message-ID: \<CqBz78.Dst@world.std.com>
References: <1994May17.140242.27932@tin.monsanto.com> \<CpyDI0.M9M@world.std.com> \<CqBJ8u.B6s@unx.sas.com> <1994May24.212757.11431@midway.uchicago.edu>
Subject: Re: Spitbol reference?

[Gary Merrill]

|       _The SNOBOL4 Programming Language, 2nd Ed., by Griswold, Poage, and
|       Polonsky.  Prentice Hall.
|
|   This is the classic language reference text.

[Richard L. Goerwitz]

|   It's also out of print!

You can, however, order it from Catspaw.

Other sources are Susan Hockey, Programming for the Humanities, Clarendon
Press, 1985, and Christopher Butler, Computers in Linguistics, Basil
Blackwell, 1985.

I like Snobol -- once I got past the idea of the goto (U rather like them
now :-0....) -- because it is fast, rock solid, great for bashing text and
above all because it is cheap.  The Catspaw Spitbol that I use is also
well-documented and has a nice online help system.

--
\<!--
  "To imagine a language is to imagine a form
   of life."                   - Wittgenstein,
                 Philosophical Investigations
</message>
<message id="<kimber.1.0015CDB6@passage.com>" date="2978822884">
Newsgroups: comp.text.sgml
Date: 25 May 1994 02:48:04 UT
From: "W. Eliot Kimber" \<kimber@passage.com>
Organization: Passage Systems, Inc.
Message-ID: \<kimber.1.0015CDB6@passage.com>
References: \<JOLLIE.94May23202222@grant.cs.uiowa.edu> <31326.smithn@orvb.saic.com> \<JOLLIE.94May24155725@grant.cs.uiowa.edu>
Subject: Re: HTML/SGML/Inclusions, etc

[Jeffrey C. Ollie] \<JOLLIE.94May24155725@grant.cs.uiowa.edu>

|   Why does the server need to parse the tags?  All it needs to do is look
|   for the special strings "\<!--#" and "-->" (on the NCSA version) and
|   then interpret the information between, which is not SGML.

[Jeffrey C. Ollie] \<JOLLIE.94May23202222@grant.cs.uiowa.edu>

|   B.  Embedding processing directives for the server in SGML comments
|       makes sense since the information is easily located (everything
|       between "\<!--#" and "-->") and it is completely ignored by browsers
|       if the server does not process it for some reason.

[Norm E. Smith] <31326.smithn@orvb.saic.com>

|   I agree with Erik, I don't like this approach from an SGML point of
|   view.  As far as I can tell, this approach is similar to the one that
|   Adobe took with PostScript. The language is defined in concrete, well
|   ROM, and must have the same operation regardless of the printer
|   brand. Vendor specific features have tended to be added as special
|   comments...

[Jeffrey C. Ollie] \<JOLLIE.94May24155725@grant.cs.uiowa.edu>

|   You miss my point.  The server should understand nothing of SGML.  What
|   if I wanted the server to preprocess some ASCII text (adding a time
|   stamp, etc.).  The only justification for using "\<!--#" and "-->" to
|   delimit server processing directives is to ensure that it has no
|   meaning at all to the HTML browser.

I think the key point in this discussion has been missed, which is that
either HTML documents are conforming SGML documents or not.  If they are
conforming SGML documents, then all processing associated with them should
be done using correct SGML mechanisms.  IT HAS NOTHING TO DO WITH WHAT THE
SERVER SHOULD OR SHOULDN'T NEED TO DO.  It only has to do with what the
defined requirements and expectations for the data itself are.

The whole point of SGML is that your documents become independent, to the
greatest degree possible, of any given processor or application.  The HTML
discussion seems to have assumed that HTML documents will only be used with
HTTP servers or, even worse, only with a specific HTML browser.  This is
not the case.  I can should be able to process or browse HTML documents
with a wide variety of SGML-based systems, including Author/Editor,
DynaText, SGML Explorer, ADEPT Publisher, SGMLS, OmniMark, etc.  If one
particular processor, HTTP servers, define a non-standard way of doing a
standard thing (including fragments of documents), especially if they do it
by putting the data in comments, then they have prevented the equivalent
use of that data by the other tools.  Of the tools I listed, only OmniMark,
as far as I know, even gives me the opportunity of implementing this
comment hack, and I wouldn't do it any case, because I shouldn't have to.

The argument that "\<!--#" is easy to scan for is specious because "\<?# >"
is no more difficult to scan for and is equally non-meaningful.

If HTTP servers want to define a non-standard encoding to support some
feature, there's no way to prevent it, obviously, but the designers of
those systems must understand the disservice they are doing to their users
by forcing them to make their data non-portable or, possibly, even invalid
SGML.  This is absolutely the worst thing you can do, because it completely
erases the primary benefit of SGML in the first place, which is data
independence and integrity.

I find the WWW very exciting.  The fact that I can create a document, throw
it onto the web blindly, and know that it can be accessed and integrated
into a larger hypertext is fantastic.  It is for this reason that I feel so
strongly about the Web's use of SGML being as correct as possible.  If the
Web were not this important, I wouldn't care, but the Web is too important
not to do it correctly.  The Web has proven the basic premise of SGML, that
by using a standard, application-independent data representation, you can
enable blind interchange among disparate and even unknown or unpredicted
applications.  Thus it hurts all the more when small details threaten to
break the realization of this very promise.

--
\<Address HyTime=bibloc>
W. Eliot Kimber (kimber@passage.com) Systems Analyst and HyTime Consultant
Passage Systems, Inc., 9971 Quail Blvd., Suite 903, Austin TX 78758 (512)339-1400
465 Fairchild Dr., Suite 201, Mountain View, CA  94043, (415) 390-0911
\</Address>
</message>
<message id="<kimber.3.0016AE17@passage.com>" date="2978826039">
Newsgroups: comp.text.sgml
Date: 25 May 1994 03:40:39 UT
From: "W. Eliot Kimber" \<kimber@passage.com>
Organization: Passage Systems, Inc.
Message-ID: \<kimber.3.0016AE17@passage.com>
References: <9405232107.AA81911@source.asset.com>
Subject: Re: HyTime and CGM

[Len Bullard]

|   An interesting idea is being examined and I wonder if anyone else is
|   working to define the issues involved.

|   HyTime defines hypermedia addressing for SGML applications. The
|   Computer Graphics Metafile (CGM) group has proposed mechanisms for
|   hyperlinks from CGM files.  As yet, I am not aware of the details of
|   how this is defined in CGM.  The issue: how do these two definitions
|   for hyperlinking coexist and cooperate?  HyTime provides a
|   notation-driven mechanism by which one passes arguments to an
|   application in a foreign notation?  How would CGM do this, and what
|   would this mean to the BOS?

The issue is not *hyperlinking*, per se, since that is a semantic issue,
but *addressing*, on which hyperlinking depends, but for which hyperlinking
is not the only application (in other words, many mundane connections are
not hyperlinks in the sense of representing some relationship semantic
stronger than "is a property of" or "is a relative of").

I'm not familiar with the CGM proposal, but its ability to interoperate
with HyTime applications depends on the addressing mechanism it uses to
address from CGM objects to locations outside the graphic.  Note that there
is no inherent problem in addressing *into* CGM graphics as long the CGM
standard defines the mechanism by which portions of CGM graphics can be
addressed and/or provides drivers for CGM that translate abstract HyTime
addresses into locations within CGM graphics.

In HyTime, all addresses start as ID or entity references.  In other words,
the primary addresses defined by SGML, element IDs and entity names, are
used by HyTime as its basic addressing mechanism as well.  Addresses of
objects that do not have IDs are created by using address location elements
to map IDs to the actual location of the data, say by using a tree
location, a data location, or a query.  Except for notation locations,
which are a special case, HyTime uses abstract location mechanisms that
abstract all data as trees, node lists, token streams, or quantum arrays.
For any data notation, drivers can be created that provide these abstract
data views (for example, a driver could be written for video that abstracts
it as a node list of frames or scenes, allowing the use of, for example,
list locations to locate portions of the video data).

HyTime also provides an "escape", the notation location, which simply
encapsulates some notation-specific address and associates it with a
defined notation.  A typical example would be to define a notation for the
graphic hot spot location notation used by your favorite browser and then
put that notation in notloc-form elements, e.g.:

\<!DOCTYPE SampDoc [
  \<!NOTATION IPFHotSpots PUBLIC "+//ISBN 0-933186::IBM//NOTATION IPF Hot
                                                         Spots//EN" >
  \<!ELEMENT IPFHotSpot - - (#PCDATA) >
  \<!ATTLIST  IPFHotSpot
       HyTime  NAME #FIXED notloc
       notation NOTATION "IPFHotSpots"
       ID          ID  #REQUIRED
       locsrc   IDREF #IMPLIED -- Ultimately data entity containing graphic --
   >
 \<!ENTITY abitmap SYSTEM "apicture.bmp" NDATA OS2bitmap >
]>
\<sampdoc>
 \<nameloc id=abitmap>\<nmlist nametype=entity>abitmap\</nmlist>\</nameloc>
 \<ipfhotspot locsrc=abitmap>ulx=100 uly=50 lrx=200 lry=25\</ipfhotspot>
\</sampdoc>

The question remains, however, how would you represent HyTime locations
within a CGM graphic?  I'm not sure what the right answer is.  One approach
would be to define the CGM notation itself as SGML and then just use
standard HyTime location addressing.  Another approach would be to adopt
HyQ as the notation for defining locations within CGM graphics, which
provides the addressing functionality of HyTime without the need to adopt
SGML syntax as well.  Finally, CGM could define a comparable notation
unique to CGM that expresses the same addressing methods, but in a syntax
more consistent with CGM encoding.

If the assumption that addressing from CGM graphics will be either to SGML
documents or other CGM graphics, then HyTime-conforming addressing is
clearly the natural choice.  If the intent is to enable addressing from CGM
graphics to any other data representation, then HyTime-conforming
addressing is still the natural choice because notation locations provide
that ability.

\<Address HyTime=bibloc>
W. Eliot Kimber (kimber@passage.com) Systems Analyst and HyTime Consultant
Passage Systems, Inc., 9971 Quail Blvd., Suite 903, Austin TX 78758 (512)339-1400
465 Fairchild Dr., Suite 201, Mountain View, CA  94043, (415) 390-0911
\</Address>
</message>
<message id="<DMEGGINS.94May25070719@aix1.uottawa.ca>" date="2978852839">
Newsgroups: comp.text.sgml
Date: 25 May 1994 11:07:19 UT
From: David Megginson \<dmeggins@aix1.uottawa.ca>
Organization: Department of English, University of Ottawa
Message-ID: \<DMEGGINS.94May25070719@aix1.uottawa.ca>
References: \<Cq7Hzu.GAC@csc.ti.com> <19940523.2389@naggum.no> <9405241301.AA41169@source.asset.com> \<CqB87D.GKt@csc.ti.com> <9405241524.AA63728@source.asset.com>
Subject: Re: Embedding LaTeX in HTML

[Len Bullard]

|   Why not try:
|
|   \<xenoproc id="xp1" notation=latex>
|   \\[ e^{i\\pi} + 1 = 0 \\]
|   \</xenoproc>

Or better yet,

  \<xenoproc id="xp1" notation=latex>
  \<![CDATA[$e^{i\\pi} + 1 = 0$\]]>
  \</xenoproc>

David
--
David Megginson                Department of English, University of Ottawa,
dmeggins@aix1.uottawa.ca       Ottawa, Ontario, CANADA  K1N 6N5
dmeggins@acadvm1.uottawa.ca    Phone: +1 613 564 6850 (Office)
ak117@freenet.carleton.ca             +1 613 564 9175 (FAX)
</message>
<message id="<31255.smithn@orvb.saic.com>" date="2978858449">
Newsgroups: comp.text.sgml
Date: 25 May 1994 12:40:49 UT
From: "Norman E. Smith" \<smithn@orvb.saic.com>
Message-ID: <31255.smithn@orvb.saic.com>
Subject: EASYDTD Version 2 available

I uploaded EASYDTD Version 2 to ftp.ifi.uio.no.  The new file resides in
/pub/SGML/Demo/easydtd2.zip.  The Zip file contains MS-DOS executable, C
source code, and documentation.  EASYDTD generates a DTD from the list of
tags in outline form.  The biggest single upgrade in this version is that
content model connectors and occurance indicators can be specified for each
element.  Version 1 inserted default values.  The following is a short
description of EASYDTD operation and a list of my other SGML utilities also
available.

I still have a list of additional enhancements in mind for EASYDTD that
I'll make as time permits.  Updates to the other utilities need to be done
before I spend much additional time on EASYDTD.

Norm

==================================================================

EASYDTD V2
----------
One of the most complex parts of SGML is the DTD.  Writing a DTD
is a tedious task at best.  I have always believed that there
must be a simpler way to express the information that an SGML
application must know to process a document.  My answer is EASYDTD.
EASYDTD uses a simple outline form of data elements to generate
a DTD.  For example:

document
  fm
    title
    author
    date
  body
    chapter
      section
        subsection
          para
          list
            le
  bm
    index

Generates a skeleton DTD.  This is the simplest input.  The DTD generated
from this file will need to be edited (primarily the content model),
but the work of getting all syntax correct is already done.  Entities
and Attributes are also handled.  The content model can be specified.


SENGINE
-------
NOTE: I am planning a major upgrade to S-Engine this summer.

S-Engine is an SGML document interpreter.  It has a macro language
in which the user specifies actions to perform when each tag is
encountered in the input stream.  The .APP files are example source
files.  The SENGINE.EXE program uses the compile form in the .BIN
files.  S-Engine can read/write files and can be embedded into other
applications to process SGML files.  Other uses for S-Engine include
converting SGML files from one tag set to another.  The program
assumes the input SGML file is valid.  The compiler portion of
S-Engine is based on the book "Write Your Own Programming Language
Using C++" and the SGML interpreter is based on "Writing Output
Drivers Using Borland C++".  The next major upgrade is planned for
the summer of 1994.


SVIEWER
-------
S-Viewer is an SGML document viewer built on top of S-Engine.  It's
macro language is a superset of the one in S-Engine.  The compiler
and interpreter are identical in the two programs.  The windowing
package is Mix C/Windows Toolbox, which restricts S-Viewer to
DOS-based systems for the present.

"Write Your Own Programming Language Using C++", ISBN 1-55622-264-5.
"Programming Output Drivers Using Borland C++", ISBN 1-55622-309-9.
Each book is $15 including source code on floppy disk.  Both books
are published by and available from:

    Wordware Publishing, Inc.
    1506 Capital Ave.
    Plano, Texas, 74075

    Phone: 214-423-0090

RAST2SGM
--------
NOTE: I originally intended to add Attribute processing and some
other enhancements, but now I use OmniMark for this type of program.
I'll put it back on my list of things to update if anyone cares,
otherwise this is it for the immediate future for RAST2SGM....

This program converts the output from RAST back to SGML form.  RAST is
the utility included with SGMLS that converts SGMLS output into a form
closely matching SGML; I forget the official description of what RAST
does.

The idea is to start with a minimized SGML file, parse it with SGMLS,
run the SGMLS output through RAST, then the RAST output through RAST2SGM
and end up with a 'normalized' SGML file (with end tags expanded).

Depending on your DTD and depending on whether you use some advanced
features of SGML, RAST2SGM may or may not generate a parsable SGML file.
However, it is a useful way to expand end tags for other SGML
applications that require them.

==================================================================

These SGML tools are available at FTP.IFI.UIO.NO in pub/SGML/Demo.

Norman Smith
Internet:   smithn@orvb.saic.com
Compu$erve: 72745,1566

--
Norman E. Smith, CDP               | Internet:   smithn@orvb.saic.com
Science Applications International | Compu$erve: 72745,1566
P.O. Box 2501                      | Ma Bell:    615-481-2186
Oak Ridge Tn. 37830                |
</message>
<message id="<2DE3CBC2@noak.vxo.telub.se>" date="2978858640">
Newsgroups: comp.text.sgml
Date: 25 May 1994 12:44:00 UT
From: Peter Bergstrom \<pebe@telub.se>
Message-ID: <2DE3CBC2@noak.vxo.telub.se>
Subject: Entity set for sub and superscript?

I would like to know how others have handled superscript and subscript in
text.  I know the old MIL-M-28001 have two elements for this, and most
other DTD's I have seen use the same technique, or simly ignore the fact
that text must sometimes be superscripted or subscripted.

What I'm looking at now is to define entity sets for the most common
expressions using super or subscript, such as U\<sub>eff\</sub>,
m\<sup>2\</sup>.  Those entities are supposed to be used in descriptive and
procedural text and tables, but not in mathematical formulas (where other
solutions might be preferred).  Some simple examples:

    The voltage U\<sub>eff\</sub> should be 5V.
    would be:
    The voltage \&Ueff; should be 5V.

    The area is 5 m\<sup>2\</sup>.
    would be:
    The area is 5 \&sqmtrs;.

Do you think it is even possible to define such an entity set, if the
application is military documentation for operation and maintenance?

Have anybody already done something that I could use or 'steal' from?

Is there any other ways to get around using specific elements for these
types of constructions?


If I succeed in defining those entity sets it will be posted in the near
future.

Thanks!
Peter
--
Peter Bergstr\&ouml;m           internet:  pebe@telub.se
Telub Inforum AB               phone:     +46 470 425 97
351 80  V\&Auml;XJ\&Ouml;        fax:       +46 470 397 96
SWEDEN
</message>
<message id="<9405251305.AA77382@source.asset.com>" date="2978859952">
Newsgroups: comp.text.sgml
Date: 25 May 1994 13:05:52 UT
From: Len Bullard \<bullardc@source.asset.com>
Message-ID: <9405251305.AA77382@source.asset.com>
Subject: Re: Embedding LaTeX in HTML

[David Megginson]

|   Or better yet,
|
|     \<xenoproc id="xp1" notation=latex>
|     \<![CDATA[$e^{i\\pi} + 1 = 0$\]]>
|     \</xenoproc>

That's interesting, but more advanced than my middling skill.  What does it
do?  Is the intent to prevent a syntax collision by declaring the section
as CDATA here rather than in the DTD?

Len
</message>
<message id="<9405251317.AA38076@source.asset.com>" date="2978860645">
Newsgroups: comp.text.sgml
Date: 25 May 1994 13:17:25 UT
From: Len Bullard \<bullardc@source.asset.com>
Message-ID: <9405251317.AA38076@source.asset.com>
References: \<Cq7Hzu.GAC@csc.ti.com> <19940523.2389@naggum.no> <9405241301.AA41169@source.asset.com> \<CqB87D.GKt@csc.ti.com> <9405241524.AA63728@source.asset.com> \<DMEGGINS.94May25070719@aix1.uottawa.ca>
Subject: Re: Embedding LaTeX in HTML

[David Megginson]

|   Or better yet,
|
|     \<xenoproc id="xp1" notation=latex>
|     \<![CDATA[$e^{i\\pi} + 1 = 0$\]]>
|     \</xenoproc>

Ah, I see.  Had to get out the Bryan book for a moment to recognize a
marked section.  Never had a system that used that feature and I tend to
remember only those features which I use.  Great.

A question to the HyTime cognoscenti.  David's example seems to be
completely applicable, however, in the examples for HyTime xenoforms,
PCDATA is used as the content of a xenoform.  would there be a particular
reason, in a HyTime context, to prefer the marked section or not?  I
understand the role of the marked section to avoid problems with resolving
markup by treating the the content as character data.  Should a HyTime
object do the same, and if so, why is this not mentioned in 10744 or any of
the published examples?

Len Bullard
</message>
<message id="<kimber.6.0008626A@passage.com>" date="2978860980">
Newsgroups: comp.text.sgml
Date: 25 May 1994 13:23:00 UT
From: "W. Eliot Kimber" \<kimber@passage.com>
Organization: Passage Systems, Inc.
Message-ID: \<kimber.6.0008626A@passage.com>
References: \<Cq7Hzu.GAC@csc.ti.com> <19940523.2389@naggum.no> <9405241301.AA41169@source.asset.com> \<CqB87D.GKt@csc.ti.com> <9405241524.AA63728@source.asset.com> \<DMEGGINS.94May25070719@aix1.uottawa.ca> <9405251317.AA38076@source.asset.com>
Subject: Re: Embedding LaTeX in HTML

[Len Bullard]

|   A question to the HyTime cognoscenti.  David's example seems to be
|   completely applicable, however, in the examples for HyTime xenoforms,
|   PCDATA is used as the content of a xenoform.  would there be a
|   particular reason, in a HyTime context, to prefer the marked section or
|   not?  I understand the role of the marked section to avoid problems
|   with resolving markup by treating the the content as character data.
|   Should a HyTime object do the same, and if so, why is this not
|   mentioned in 10744 or any of the published examples?

It's not a HyTime issue -- it's purely an SGML Parsing issue, addressed
either in a DTD or in a document instance.  If the content of a particular
element might be interpreted as SGML markup, you have three choices:

1.  Give the containing element a declared content of CDATA or RCDATA.
    *THIS IS NOT RECOMMENDED*

    The reason it's not recommended is that the element content will be
    ended by the first valid end tag open encountered (e.g., "</" in the
    reference concrete syntax).  This is obviously not robust.  It also
    makes it impossible to apply ad-hoc parsing against such a document
    (parsing where the parser doesn't know or care about content models but
    only scans delimiters and does pattern matching) because you only know
    to ignore markup if you know what the content rule for the element is.

2.  Use a CDATA marked section, as shown in David's example.  This is a
    much better solution, since it is more robust, being ended only by a
    marked section close ("\]]>" in the reference concrete syntax).  Also,
    since the content marking is explicit, ad-hoc parsers can handle it.

3.  Put the data in a data entity.  This is the most robust solution, as
    there will never be any parsing problems.  It also allows the use of
    notations and notation attributes to further describe the data object.

Note that by mentioning ad-hoc parsing of SGML I'm not necessarily
suggesting you *should* use ad-hoc parsing of SGML.  I'm just pointing out
that by making certain choices in your element declarations and feature
usage, you can enable reliable parsing of SGML documents by less than
full-featured parsers, and even, if you're a gambler and have complete
control over your instances, simple Perl pattern matching scripts (if
(s/(\<P>)//) { print "It's a paragraph\\n"}).

--
\<Address HyTime=bibloc>
W. Eliot Kimber (kimber@passage.com) Systems Analyst and HyTime Consultant
Passage Systems, Inc., 9971 Quail Blvd., Suite 903, Austin TX 78758 (512)339-1400
465 Fairchild Dr., Suite 201, Mountain View, CA  94043, (415) 390-0911
\</Address>
</message>
<message id="<2rvqct$pt6@panix2.panix.com>" date="2978868061">
Newsgroups: comp.text.sgml
Date: 25 May 1994 15:21:01 UT
From: Shawn Mitchell \<smitchel@panix.com>
Organization: PANIX Public Access Internet and Unix, NYC
Message-ID: <2rvqct$pt6@panix2.panix.com>
Subject: Seeking MS-Windows SGML viewer object

I am looking for a MS-Windows window object for viewing SGML files.
Preferably C++ with source and no run-time fee. :)  I apologize if this is
inappropriate for this group.  Thanks in advance.

--Shawn
</message>
<message id="<9405252345.AA3100@notes.notes>" date="2978872889">
Newsgroups: comp.text.sgml,comp.text,comp.lang.misc,comp.lang.icon
Date: 25 May 1994 16:41:29 UT
From: Ken Holman \<gkholman@msl.isis.org>
Message-ID: <9405252345.AA3100@notes.notes>
References: <1994May17.140242.27932@tin.monsanto.com> \<CpyDI0.M9M@world.std.com> \<CqBJ8u.B6s@unx.sas.com> <1994May24.212757.11431@midway.uchicago.edu>
Subject: Re: Spitbol reference?

[Gary Merrill]

|   It is one of my favorite languages, though one hardly sees it used much
|   any more at all.

[John Smith]

|   I like Snobol

I, personally, hated using Snobol but was biased because I only had to
learn it and know it for two weeks to do a single assignment regarding
pattern matching in a languages course in 1978 and haven't needed to touch
it since.

What features of the language particularly helped you or made you feel that
the language was more useful than others?

The only thing I remember from Snobol was that there was no such thing as a
syntax error.  My recollection is that anything and everything that could
be typed on a line _meant_ something.  There were no bounds to help with
identifying typos, improperly typed constructs, etc.

Very frustrating when an assignment had to be done in short order.

--
G. Ken Holman                 | Phone:  +1 613 727-5696  x317
Vice President, R\&D           | Fax:    +1 613 727-9491
Microstar Software Ltd.       | US:      1 800 267-9975
34 Colonnade Road North       | BBS:    +1 613 727-5272
Nepean Ontario CANADA K2E-7J6 | E-mail: gkholman@msl.isis.org
</message>
<message id="<1994May25.183650.22336@midway.uchicago.edu>" date="2978879810">
Newsgroups: comp.text.sgml
Date: 25 May 1994 18:36:50 UT
From: Burt Totaro \<totaro@math.uchicago.edu>
Organization: Dept. of Mathematics
Message-ID: <1994May25.183650.22336@midway.uchicago.edu>
Subject: How to prepare an SGML manuscript?

I'm thinking of writing a mathematics book, and I want to code it in SGML.
Some of the material already exists as Latex files; some will be original
and coded directly into SGML.  Given that I have access to a Sun computer,
is ArborText the best software package to use for such heavily mathematical
copy?  I feel that there must be something better.  What else is out there?

Burt Totaro

--
Burt Totaro
University of Chicago
</message>
<message id="<769891591snz@tommy.demon.co.uk>" date="2978880391">
Newsgroups: comp.text.sgml
Date: 25 May 1994 18:46:31 UT
From: John Nissen \<jn@tommy.demon.co.uk>
Organization: Technology for Object Management
Message-ID: <769891591snz@tommy.demon.co.uk>
References: <2r5r14INN5sc@smokies.cis.ohio-state.edu>
Subject: Re: Databases and/or Object Stores for SGML and/or HTML Docum 

[William Bradley]

|   I am looking for information on projects (commercial or academic) which
|   revolve around the storage of SGML and/or HTML documents in databases
|   or object stores.
|   
|   Any information would be greatly appreciated.

Hello Bill,

Somebody on this group did mention databases, adding that that they were
expensive.  There does seem to be a real need for something at reasonable
cost.  I'm interested in using SGML (and or HTML) to structure documents
for blind people, and intend to develop an object store.  Does anybody have
a tool which would help me to extract the structure of an SGML document so
I can put it in the store as a structure of objects?  I'll also need the
converse ability -- take a structure in the store and present it as an SGML
document.

John

--
  John Nissen      |    Technology     |    Tel: +44 81 742 3170
  Thames Bank      |       for         |    Fax: +44 81 742 8715
  Chiswick Mall    |      Object       |    Email: jn@tommy.demon.co.uk
  London W4 2PR    |    Management     |    Consultancy: Lapcrest Ltd
</message>
<message id="<2s0cug$cbl@usenet.rpi.edu>" date="2978887056">
Newsgroups: comp.text.sgml,comp.text
Date: 25 May 1994 20:37:36 UT
From: Marc Edgar \<edgarm@rpi.edu>
Organization: Rensselaer Polytechnic Institute, Troy, NY.
Message-ID: <2s0cug$cbl@usenet.rpi.edu>
Subject: PageMaker to HTML Converter

Greetings,

I hope I am not to taking up too much bandwidth to ask if there is a way to
convert PageMaker documents to HTML?

Is it possible to convert Frame docs using the Frame Interchange Format?
That is, PageMaker -> RTF -> HTML?

I have checked out the page
http://info.cern.ch/hypertext/WWW/Tools/Word_proc_filters.html
and did not a Frame to HTML converter.

Thanks in advance.
Marc Edgar
edgarm@rpi.edu
</message>
<message id="<2s0o8c$b6l@crl.crl.com>" date="2978898636">
Newsgroups: comp.text.sgml
Date: 25 May 1994 23:50:36 UT
From: Blueberry Software \<filtrix@crl.com>
Message-ID: <2s0o8c$b6l@crl.crl.com>
Keywords: ATA DTD
Summary: seeking SGML documents which utilize the complete ATA DTD.
Subject: seeking ATA compliant documents

Our company would like to obtain example documents which comply with the
ATA's DTD.  We are hoping to find some which utilize all or nearly all of
the elements in the ATA DTD.

Would some of you kind folks point us to an ftp server which has such a
thing?

OR

If you have such a document would you please share it with us?

Please mail to
filtrix@crl.com.

Thanks
</message>
<message id="<2s2232INN33c@oasys.dt.navy.mil>" date="2978941474">
Newsgroups: comp.text.sgml,comp.text
Date: 26 May 1994 11:44:34 UT
From: Betty Harvey \<harvey@oasys.dt.navy.mil>
Organization: Carderock Division, NSWC, Bethesda, MD
Message-ID: <2s2232INN33c@oasys.dt.navy.mil>
References: <2s0cug$cbl@usenet.rpi.edu>
Subject: Re: PageMaker to HTML Converter

[Marc Edgar]

|   I hope I am not to taking up too much bandwidth to ask if there is a
|   way to convert PageMaker documents to HTML?
|   
|   Is it possible to convert Frame docs using the Frame Interchange
|   Format?  That is, PageMaker -> RTF -> HTML?

I haven't tried it but you would have to convert it either to Word Perfect,
MSWord or RTF.  I have only converted PageMaker to Word Perfect and I
wasn't really happy with the conversion of the data.  RTF might be a better
option, especially since there is a RTF2HTML utility.

I did find a WP2HTML WordPerfect macro, but haven't tested it yet.

|   I have checked out the page
|   http://info.cern.ch/hypertext/WWW/Tools/Word_proc_filters.html
|   and did not a Frame to HTML converter.

You can find this on FTP at src.doc.ic.ac.uk under
/computing/information-systems/www/tool/translators I think the
subdirectory is called frame2html.  I think this is the subdirectory but I
know it is in this ftp repository.

Hope this helps.

--
Betty Harvey  \<harvey@oasys.dt.navy.mil>     | David Taylor Model Basin
Advanced Information Systems Branch          | Carderock Division
Code 183                                     | Naval Surface Warfare
Bethesda, Md.  20084-5000                    |   Center
(301)227-3348   FAX (301)227-3343            | DTMB,CD,NSWC
</message>
<message id="<CqEtM6.I89@unx.sas.com>" date="2978944494">
Newsgroups: comp.text.sgml,comp.text,comp.lang.misc,comp.lang.icon
Date: 26 May 1994 12:34:54 UT
From: Gary Merrill \<sasghm@theseus.unx.sas.com>
Organization: SAS Institute Inc.
Message-ID: \<CqEtM6.I89@unx.sas.com>
References: <1994May17.140242.27932@tin.monsanto.com> \<CpyDI0.M9M@world.std.com> \<CqBJ8u.B6s@unx.sas.com> <1994May24.212757.11431@midway.uchicago.edu> <9405252345.AA3100@notes.notes>
Subject: Re: Spitbol reference?

[G. Ken Holman]

|   What features of the language particularly helped you or made you feel
|   that the language was more useful than others?

Oh dear!  I don't see how anyone who has actually used the language for
anything significant can ask this.  Here are a few significant features:

   1. String patterns as a fundamental part of the language; full and
      partial pattern matching; conditional and unconditional assignment as
      a result of matching or failure to match.

   2. Ability to define patterns recursively.  This is an incredibly
      powerful feature.

   3. User defined data types.

|   The only thing I remember from Snobol was that there was no such thing
|   as a syntax error.  My recollection is that anything and everything
|   that could be typed on a line _meant_ something.  There were no bounds
|   to help with identifying typos, improperly typed constructs, etc.

Don't be such a wimp.  You probably don't like LISP either.

|   Very frustrating when an assignment had to be done in short order.

This, of course, is a student's perspective.  You are excused for having
the attitude you do.  SNOBOL (and LISP, and APL) are *very* different from
"normal" programming languages.  Many (most?)  programmers never come to
understand or like them.  Since I made my living (such as it was) as a
logician for about a decade, I have a fundamental affinity to SNOBOL and
LISP.  From *my* point of view, their conceptual foundations are *natural*
and those of "normal" (procedural) languages are a bit "odd".  (I can still
say this after more than 10 years of writing C and C++ compilers.)

As a formal programming language, as you point out, SNOBOL is pretty
appalling in various ways -- and you didn't even mention the virtual lack
of decent control structures (let's not count the computed "goto" among
these, shall we?).  But it is (was?)  a powerful language.  I once wrote a
derivation checking program (including I/O, which is another great topic in
SNOBOL) for sentential logic in about 200 lines of SNOBOL (including the
definitions of data types and recursive patterns).  The same program in
PL/1 was over 1200 lines -- and to my mind somewhat less clear than the
SNOBOL version.

But look ...  Quite honestly, my remarks about the use of SNOBOL and
"elderly text processing tools" were sarcastic.  The fact that in a given
context people are driven to the heavy and ad hoc use of such things should
tell us something about that context.  I love SNOBOL for what it is -- just
as I can admire a nineteenth century set of builder's tools for what they
are.  But I wouldn't use century old tools to build my house (unless -- as
academics are inclined to proceed -- I was interested more in the
experience of wielding the tools than in the quality of the finished
product).  If I really had a task suited to SNOBOL, I would probably use
ICON (which I think of, and I thing *Griswold* thinks of as "what SNOBOL
should have been").  But I might regard even this as a prototype and write
the production version in C or C++.

--
Gary H. Merrill  [Principal Systems Developer, Compiler and Tools Division]
SAS Institute Inc. / SAS Campus Dr. / Cary, NC  27513 / (919) 677-8000
sasghm@theseus.unx.sas.com ... !mcnc!sas!sasghm
</message>
<message id="<2s2fla$mv9@altair.herts.ac.uk>" date="2978955370">
Newsgroups: comp.text.sgml
Date: 26 May 1994 15:36:10 UT
From: Gilbert Wu \<erdqgsw@herts.ac.uk>
Organization: University of Hertfordshire
Message-ID: <2s2fla$mv9@altair.herts.ac.uk>
Subject: HyTime Treeloc

HyTime Tree Location

We are having problems in understanding how a treeloc with multiple source
locations is to be resolved.  The problems are as follows:

1. ISO10744:8.4.2.3(p.60) paragrah 2 states:

   "The attribute list form treecom can be used with a multiple location
   source to view the node list as a tree."  ...

   Assuming that we set the treecom attribute to treecom.  Does this mean
   that the first element in the list becomes the root, the second element
   becomes the child and the third element becomes the grandchild and so
   on?  Or does this mean that a new root node is introduced above the list
   and all elements become children of this root?

2. As an example, we have the following structure:

	     \<chapter id=e1>
		     \<section>
			     \<para>\</para>
			     \<para>\</para>
			     \<para>\</para>
			     \<para>\</para>
		     \</section>
		     \<section>
			     \<para>\</para>
			     \<para>\</para>
			     \<para>\</para>
			     \<para>\</para>
		     \</section>
	     \</chapter>

	     \<chapter id=e2>
		     \<section>
			     \<para>\</para>
			     \<para>\</para>
			     \<para>\</para>
		     \</section>
		     \<section>
			     \<para>\</para>
			     \<para>\</para>
			     \<para>\</para>
		     \</section>
	     \</chapter>

    Some tree location instances are defined using the about 2 chapters as
    location sources:

    1)	\<treeloc locsrc="e1 e2" treecom = treecom >
	     \<marklist>1 2 2\</marklist>
	\</treeloc>

	Does treeloc point to: para 2, section 2 of chapter e1 or section 2
	of chapter e2 or something else?

    2)	\<treeloc locsrc="e1 e2" treecom = treecom >
	     \<marklist>1 3 1\</marklist>
	\</treeloc>

	Does treeloc point to: section 1 of chapter e2 or nothing?

    3)	\<treeloc locsrc="e1 e2" treecom = ntreecom >
	     \<marklist>1 2 1\</marklist>
	\</treeloc>

	para 1, section 2 of chapter e1?

We welcome any suggestion.

Regards

Gilbert Wu, Steve Baird
ERDC
Univ of Herts.
U.K.
</message>
<message id="<9405261653.AA12456@mercury>" date="2978960024">
Newsgroups: comp.text.sgml
Date: 26 May 1994 16:53:44 UT
From: Mary Holstege \<holstege@mercury.kset.com>
Message-ID: <9405261653.AA12456@mercury>
References: <1994May17.140242.27932@tin.monsanto.com> \<CpyDI0.M9M@world.std.com> \<CqBJ8u.B6s@unx.sas.com> <1994May24.212757.11431@midway.uchicago.edu> <9405252345.AA3100@notes.notes>
Subject: Re: Spitbol reference?

[Ken Holman]

|   I, personally, hated using Snobol but was biased because I only had to
|   learn it and know it for two weeks to do a single assignment regarding
|   pattern matching in a languages course in 1978 and haven't needed to
|   touch it since.
|
|   What features of the language particularly helped you or made you feel
|   that the language was more useful than others?
|
|   The only thing I remember from Snobol was that there was no such thing
|   as a syntax error.  My recollection is that anything and everything
|   that could be typed on a line _meant_ something.  There were no bounds
|   to help with identifying typos, improperly typed constructs, etc.

The nice thing about SNOBOL is that if you are used to thinking of grammars
as the proper way to express parsing problems, then any time you have a
parsing problem, SNOBOL let's you express the parser as a pretty
conventional-looking grammar.  Whipping up (grammar) rule-based systems can
be accomplished with lightning speed in SNOBOL.

Granted it has its deficiencies, some of which you point out, but I still
find that nothing beats it for simple parsing problems.  For me, awk stands
for awkward.  What has happened to SNOBOL is that the people who invented
it fell in love with C and look SNOBOL, mixed it up with C, and produced
ICON, which is itself a very nice language.  It no longer has the simple
grammar-pattern look of SNOBOL, but flow control etc., is a lot better.
(FYI: You can get it from icon-project@cs.arizona.edu; anonymous FTP to the
same site.)

It's a pity more folks aren't trained in grammar-building or in thinking
about grammars in a grammatical fashion.  For example, I reckon a lot of
the frustration people have trying to make DTDs is that they are not used
to writing grammars, and that is fundamentally what a DTD is.  I also think
it's a shame that the inventors of SGML were not a bit more inclined
towards thinking grammatically, or some of the rough edges of SGML wouldn't
exist.

For instance, the big gaping hole in SGML that everyone keeps falling into
is that there is no way *within SGML proper* to express the concrete syntax
of the document language being specified in the DTD.  This is why SGML is
not self-descriptive and needs the kludgey-looking add-on for specifying
its own concrete syntax.  It also means people keep falling into the trap
of trying to specify concrete syntax relations using abstract syntax
elements and playing games with SHORTTAG and because that is what SGML
gives you.

If I may climb on my soapbox a little bit here:

To completely specify a document for portability, you need to specify its
contents plus its grammar.  The grammar defines the language to which the
document belongs.  What is the "grammar"?  It is the abstract syntax
(element to component relations plus element in class relations), plus the
concrete syntax (abstract syntax to abstract geometry relations), plus the
abstract geometry (visual element to element relations, visual class
relations, etc.).

SGML is fine for specifying abstract syntax (although it is missing a
couple of basic distinctions that would make life easier), but it leaves
concrete syntax and abstract geometry alone, which is too bad.  Right now
we're in the situation of everyone running around madly trying to come up
with adjuncts to SGML to accomplish this, and no single standard to use
with the SGML abstract syntax specification, and a big mess.

Axe grinding alert: I developed a complete model for document specification
in my thesis.  In background I tinker around with trying to figure out how
to layer the concrete syntax and abstract geometry parts of it onto SGML
abstract syntax.  There are, alas, technical difficulties in using the SGML
abstract syntax model.

                -- Mary
                   Holstege@kset.com

--
Mary Holstege, Sr. Member of Technical Staff
KnowledgeSet Corporation
555 Ellis Street                    Tel: (415) 254-5452
Mountain View, CA 94043             FAX: (415) 254-5451
</message>
<message id="<2s2mms$bn4@brachio.zrz.TU-Berlin.DE>" date="2978962588">
Newsgroups: comp.text.sgml
Date: 26 May 1994 17:36:28 UT
From: Matthias Butt \<mbut0135@w203zrz.zrz.tu-berlin.de>
Message-ID: <2s2mms$bn4@brachio.zrz.TU-Berlin.DE>
Subject: Multi language documents - how to?

I am looking for a principled solution for dealing with multilingual
documentation (if there is any).

I can imagine two approaches:

    * keep the language versions in one instance
    * keep the language versions in separate instances

I would like to start from a "monolingual" DTD.  All support for multiple
languages should be by simple addition to that DTD.  I would like to be
able to arbitrarily fix the level of language-to-language correspondence,
i.e., I would like to be able to choose whether I require a paragraph-
by-paragraph correspondence or only a much looser section-by-section
correspondence.

I would also like to have the choice to make the correspondence required or
optional, not only for the entire document but individually on each level
which allows for a correspondence.

Furthermore the whole thing should be flexible enough to allow as many
languages as I want.

Any ideas or concepts out there?
</message>
<message id="<2s2vo0$eu3@lastactionhero.rs.itd.umich.edu>" date="2978971840">
Newsgroups: comp.text.sgml
Date: 26 May 1994 20:10:40 UT
From: Eric Ederer \<ewederer@umich.edu>
Organization: SILS-University of Michigan
Message-ID: <2s2vo0$eu3@lastactionhero.rs.itd.umich.edu>
Subject: Archive Practic and SGML

I am a graduate student at University of Michigan School of Library and
Information Science.  Myself and Linda Ritz (another graduate student) are
doing research in SGML and archival practice.  We are interested in how
SGML formatted documents could be or are being used in the creation and
maintence of electronic archives.

While we are interested in all aspects of this subject, we are especially
interested in how SGML is being used to help authenticate and organize
electronic archives.  We would also be interested in how SGML has helped
with the appraisal of electronic document collections.

For those of you who are not archivists, I would like to give a brief
explanation of what archivists mean by authentication, organization, and
appraisal.

The archival science of authentication is called Diplomatics.  Diplomatics
evolved out of the practice of medieval scribes attempting to authenticate
handwritten manuscripts of the court.  They examined the paper, the ink,
handwriting, and format of the paper to see if it was authentic.  This
science has continued to evolve.  Linda and I are working on updating this
science with electronic records.  The suggestion for this line of inquiry
comes in part from the archivist David Bearman of the University of
Pittsburgh.

When archivists decide to keep a collection, they try to preserve it
according to its original order.  They attempt to keep the files how the
creating organization first ordered the documents.  This helps people
within the organization finds the documents, and the ordering helps to
reveal the original structure and nature of the organization.  This is
critical metadata for archivists.

Finally, we are interested in how SGML has helped in the appraisal aspects.
Appraisal is the process by which archivists decide which record to keep.
While there are many theories of appraisal, here are three archival values
in selection.  First, how important is the item to the organization
maintaining the archive.  Second, what is the informational value of the
item to the organization.  Thirdly, what is historical, cultural, or
scientific research value of the document.

We would also welcome comments regarding Hytime as well.

Please send responses by e-mail to myself.  Linda and I will post a summary
of our findings on this news group later.

We look forward to hearing from anyone doing work in this area.
</message>
<message id="<truly.108.000DB612@lunemere.com>" date="2978973756">
Newsgroups: comp.text.sgml
Date: 26 May 1994 20:42:36 UT
From: Truly Donovan \<truly@lunemere.com>
Organization: La Lunemere
Message-ID: \<truly.108.000DB612@lunemere.com>
References: <2s2mms$bn4@brachio.zrz.TU-Berlin.DE>
Subject: Re: Multi language documents - how to?

[Matthias Butt]

|   I am looking for a principled solution for dealing with multilingual
|   documentation (if there is any).
|
|   I can imagine two approaches:
|
|	* keep the language versions in one instance
|	* keep the language versions in separate instances
|
|   I would like to start from a "monolingual" DTD.  All support for
|   multiple languages should be by simple addition to that DTD.  I would
|   like to be able to arbitrarily fix the level of language-to-language
|   correspondence, i.e., I would like to be able to choose whether I
|   require a paragraph-by-paragraph correspondence or only a much looser
|   section-by-section correspondence.
|
|   I would also like to have the choice to make the correspondence
|   required or optional, not only for the entire document but individually
|   on each level which allows for a correspondence.
|
|   Furthermore the whole thing should be flexible enough to allow as many
|   languages as I want.
|
|   Any ideas or concepts out there?

The declaration of language itself is of course an attribute of the highest
level document element where a language declaration is allowed.  I infer
that you are primarily concerned with "simultaneous translation" documents
rather than documents that have incidental inclusions of material in
another language.

I don't think there is any reasonable way to enforce structural
correspondence through the DTD (except for the addition of language that
allows the user to declare the required level of correspondence).  Good
editing tools are the best bet for creating correspondence, and perhaps for
validation as well.

I'm not sure that it matters a whole lot how many "instances" you have, as
long as each language version is a separate entity.  If for no other
reason, this facilitates the work of authors and translators.  Then
multiple scaffolds can be constructed to pull together or process
separately whatever presentation instances might be needed.

The challenges here lie primarily in meeting the many different user
requirements for output presentation -- source correspondence is one thing,
presentation synchronization is quite another.  The last time I looked at
this problem in detail, the user requirements were all over the place, so
if you are really looking for a general-purpose solution, you will have
your work cut out for you.

Truly Donovan
publishing consultant

</message>
<message id="<9405262058.AA92279@source.asset.com>" date="2978974734">
Newsgroups: comp.text.sgml
Date: 26 May 1994 20:58:54 UT
From: Len Bullard \<bullardc@source.asset.com>
Message-ID: <9405262058.AA92279@source.asset.com>
References: <1994May17.140242.27932@tin.monsanto.com> \<CpyDI0.M9M@world.std.com> \<CqBJ8u.B6s@unx.sas.com> <1994May24.212757.11431@midway.uchicago.edu> <9405252345.AA3100@notes.notes> <9405261653.AA12456@mercury>
Subject: Re: Spitbol reference?

I am among the unwashed untutored in the ways of abstract grammars.  Also
may be suffering from use of overloaded terms.  Have never seen references
to abstract geometry in this context.  Have seen mandelbrot's use of
geomorphology but don't think that is what you had in mind.

As you seem to be well versed in all of this, could you annotate your
posting so I can understand it, perhaps with some examples, or please give
a list of references by which I may decipher it?

Would really like to understand the issues you are raising if only a
little?

"A little mind is a sad thing to waste; but it's all I have."

Len bullard
</message>
<message id="<truly.109.001024CD@lunemere.com>" date="2978982512">
Newsgroups: comp.text.sgml
Date: 26 May 1994 23:08:32 UT
From: Truly Donovan \<truly@lunemere.com>
Message-ID: \<truly.109.001024CD@lunemere.com>
Subject: Multi-language documents correction

I haven't learned yet how to go fix a posting that has already gone by, so
bear with me, please.  I said in an earlier posting that the language
attribute went on the highest level element where you would allow a
language shift; the brain was disengaged, temporarily we hope.  It goes on
*any* level where a language shift is allowed.

Truly Donovan
</message>
<message id="<kimber.8.00133A7E@passage.com>" date="2978986415">
Newsgroups: comp.text.sgml
Date: 27 May 1994 00:13:35 UT
From: Eliot Kimber \<kimber@passage.com>
Organization: Passage Systems, Inc.
Message-ID: \<kimber.8.00133A7E@passage.com>
References: <2s2mms$bn4@brachio.zrz.TU-Berlin.DE>
Subject: Re: Multi language documents - how to?

[Matthias Butt]

|   I am looking for a principled solution for dealing with multilingual
|   documentation (if there is any).
|   
|   I can imagine two approaches:
|   
|       * keep the language versions in one instance
|       * keep the language versions in separate instances

For the IBMIDDoc DTD, we developed the general concept of "retrieval
alternatives" for elements, where for a given "base" element, you could
address one or more alternate elements.  The primary motivation for this
function was to enable the creation of multi-language documents where you
want to have different language versions of a given element related to each
other unambiguously.  However, the function is not limited to language-
based differentiation, but can be used for any number of such purposes.

We also developed the concept of "object libraries", which are simply
containing elements whose semantic is that the elements they contain are
only used by reference.  This provides an alternative organizing method to
entities that allows you to group elements that are intended to be reused
into common containers.  Re-use of these elements is accomplished by any
number of means, including via HyTime content location (conloc), as a
retrieval alternative, or via some other reference, such as explicit query
or application-defined association (for example, glossary entries held in
an object library are used by associating them to instances of DefinedTerm
elements in the content portion of the document).

While our initial design for retrieval alternatives always made the
association by ID reference, we later took a page out of the IETM book and
defined an element that serves to contain elements together as alternates
of a single "logical" element.

The various alternative forms of a given element are distinguished by
comparing their descriptive properties (e.g., language, version, hardware
platform, user type, etc.).  For example, to have multi-language versions
of a given object, you might have something like this:

    \<!DOCTYPE IBMIDDOC PUBLIC "+//ISBN 0-933186::IBM//DTD IBMIDDOC//EN">
    \<ibmiddoc language="English">\<!-- Base language is English -->
    \<prolog>
      ...
    \<objectlibrary>
     \<p id=p1j language="Japanese">This is the Japanese version\</p>
     \<p id=p1esp language="Spanish">This is the Spanish version\</p>
    \</objectlibrary>
    ...
    \<div>Test of Retrieval Alts
    \<p id=p1base retrievalalts="p1j p1esp">
    This is the base (English) version\</p>
    \</div>
    ...
    \</ibmiddoc>

The base paragraph has now been unambiguously associated to the other
versions, and each version is distinguished by its language property (the
base paragraph inherits its language property from its ancestor, in this
case the IBMIDDoc element itself).  A processing program should have no
difficulty doing whatever it is you want it to do: print a specific
language version, produce a side-by-side document, arrange editing windows
to show correspondence, etc.

Note also that since IBMIDDoc is a HyTime application, you can use any
supported HyTime addressing methods to locate alternatives in other
documents.  For example, you might have documents that consist of nothing
but object libraries for specific language versions of various objects,
e.g.:

    \<!DOCTYPE ObjectLibrary PUBLIC "+//ISBN 0-933186::IBM//DTD IBMIDDOC//EN">
    \<objectlibrary language="Japanese">
     \<desc>This object library contains re-usable objects translated into
    Japanese\</desc>
     \<p id=p1j>...\</p>
     \<p id=p2j>...\</p>
    \</Objectlibrary>

To locate these paragraphs, you'd use a named location-form element, e.g.:

    \<nameloc id=p1j-loc>
      \<nmlist docorsub=JapaneseObjects nametype=element>p1j\</nmlist>
    \</nameloc>
    \<p id=p1base retrievalalts="p1j-loc p1esp">...

Assuming that the above ObjectLibrary entity has been declared with the
name JapaneseObjects.

In IBMIDDoc, this facility can be used with any element, but you could
easily restrict its use to specific elements by only providing the
RetrievalAlts attributes for specific elements or by only allowing certain
elements within a RetrievalAlts element (which is what the IETM DTDs do).

--
\<Address HyTime=bibloc>
W. Eliot Kimber (kimber@passage.com) Systems Analyst and HyTime Consultant
Passage Systems, Inc., 9971 Quail Blvd., Suite 903, Austin TX 78758 (512)339-1400
465 Fairchild Dr., Suite 201, Mountain View, CA  94043, (415) 390-0911
\</Address>
</message>
<message id="<kimber.9.001676F0@passage.com>" date="2978998063">
Newsgroups: comp.text.sgml
Date: 27 May 1994 03:27:43 UT
From: "W. Eliot Kimber" \<kimber@passage.com>
Organization: Passage Systems, Inc.
Message-ID: \<kimber.9.001676F0@passage.com>
References: <2s2fhg$loa@altair.herts.ac.uk>
Subject: Re: HyTime Treeloc

[Gilbert Wu]

|   HyTime Tree Location
|
|   We are having problems in understanding how a treeloc with multiple
|   source locations is to be resolved. The problems are as follows:
|
|   1. ISO10744:8.4.2.3(p.60) paragrah 2 states:
|
|      "The attribute list form treecom can be used with a multiple
|      location source to view the node list as a tree."  ...
|
|      Assuming that we set the treecom attribute to treecom.  Does this
|      mean that the first element in the list becomes the root, the second
|      element becomes the child and the third element becomes the
|      grandchild and so on?  Or does this mean that a new root node is
|      introduced above the list and all elements become children of this
|      root?

I had the same question myself just yesterday as I was making sure the
section on Treeloc of my forthcoming HyTime book was correct.  I had never
given much thought to treecom before this, but found this question a real
stumper until Neil Kipp of TechnoTeacher pointed out to me the what should
have been the obvious key fact: The first number in a tree location is the
axis marker for the node in the node list that is the root of the tree.
This is why you must specify a "1" for the first number in a treeloc when
the tree is a singly-rooted tree: you are specifying the first node in a
one-node node list.

Thus, given a multiply-rooted tree (a "forest" in HyTime terms), a treeloc
starting "1 ..." applies to the tree rooted at the first node in the root
node list, "2 ..." applies to the second tree, and so on.

Once this was pointed out to me, it all became clear and rational.  A
closer reading of the description of treeloc reveals these key sentences
(italics mine):

"The addressable range of the root level (*unless there are multiple
roots*) is a single quantum.  *It is addressed by a single axis marker that
locates a node within it.*"

Neil suggested that treeloc would be better named "forestloc" since it
really defines a location within a single tree within a forest of trees.
Viewed this way, multiply-rooted trees (forests) make perfect sense and all
treeloc confusion evaporates.

This also makes listloc make sense.  Section 8.4.2.6 says:

    "A listloc acts like a pathloc whose location source is a single-level
    tree.  The second dimspec of each pair is omitted, but is taken to be
    "1 1".

If I've understood this correctly, this should probably say "a listloc acts
like a pathloc whose location source is a single-level *multiply-rooted*
tree."

In other words, a pathloc can represent a forest, and a single-level forest
is a node list where each node is the root of each tree in the forest.

Consider this simple tree:

                                                     A
                                                   /     \\
                                                 B      C
                                                /  \\     /  \\
                                               D  E  F  G

The pathloc matrix for the whole tree is:

          A A A A
          B B C C
          D E F G

However, if we make nodes B and C the location source for a path location,
the pathloc matrix would be:

            B B C C
            D E F G

Node B is the root of the first tree of the forest, node C is the second.
Using the same location source for a treeloc, node G would be located at "2
2".  Node C is the second node of the root nodelist, node G is the second
node of the next level's node list.

Finally, to get back to treecom, when the treecom value is "treecom", then
the location source defines a multiply-rooted tree and a location returns a
single node in the case of treeloc.  However, when the treecom value is
"ntreecom", then each member of a multiple location source is taken
*individually* as the root of a tree, and the location is applied to each
tree in turn.  Thus, given the same location source above, but with a
treecom value of "ntreecom" for the treeloc, the treeloc "1 2" would be
applied first to the tree rooted at B, returning the node "E", and then
against the tree rooted at C, returning the node "G", resulting in the
returned node list "E G".

This could be very handy.  For example, say you wanted the titles of all
the sections in a document, and you know that the title-bearing element is
always the first child of the section.  You could locate those titles like
so:

  \<!-- Assume sections are at third level in doc: Doc-body-section -->

  \<!-- Define location source that locates all sections in document: -->
  \<pathloc id=all-sections-loc set=set>1 -1 3 1\</pathloc>

  \<!-- Now locate the first child of each section: -->
  \<treeloc treecom=ntreecom
    locsrc="all-sections-loc">1 1\</treeloc>

The result of this treeloc will be a node list containing the first child
of each object located by the path location.  Pretty handy.  This could
also be expressed with the following HyQ query:

  treeloc(pathloc(1 -1 3 1 set) 1 1 ntreecom)

--
\<Address HyTime=bibloc>
W. Eliot Kimber (kimber@passage.com) Systems Analyst and HyTime Consultant
Passage Systems, Inc., 9971 Quail Blvd., Suite 903, Austin TX 78758 (512)339-1400
465 Fairchild Dr., Suite 201, Mountain View, CA  94043, (415) 390-0911
\</Address>
</message>
<message id="<2s51ot$6qb@oclc.org>" date="2979039453">
Newsgroups: comp.text.sgml
Date: 27 May 1994 14:57:33 UT
From: Keith Shafer \<shafer@oclc.org>
Organization: OCLC Online Computer Library Center, Inc.
Message-ID: <2s51ot$6qb@oclc.org>
Keywords: Automatic DTD Creation
Subject: Chicken and Egg DTD Problem

[Norman E. Smith]

|   EASYDTD V2
|   ----------
|   One of the most complex parts of SGML is the DTD.  Writing a DTD is a
|   tedious task at best.  I have always believed that there must be a
|   simpler way to express the information that an SGML application must
|   know to process a document.  My answer is EASYDTD.  EASYDTD uses a
|   simple outline form of data elements to generate a DTD.

I agree!!!  SGML for the masses seems to be severely limited by the chicken
and egg problem of DTD existence/use.  We often encounter this problem at
OCLC when building free-text, SGML-like databases where the input records
are created by outside sources.

In many such cases, we can assume that the documents are fairly well marked
up, but there is no initial DTD describing the documents.  So, we can't
build a database of the documents - what structure do you index, what
structural points do you make available, etc.  While you can do it by hand,
it very time consuming!

To solve this problem, we've created a suite of tools that allows you to
build a DTD by example.  That is, the system can take up marked text,
extract the tags from it and build a reduced DTD.  This approach differs
from EASYDTD in that you can build/reduce the DTD by example instead of
outlining the structure.  Both approaches are very useful, depending on a
given instance of the problem.

Unfortunately our work is currently not in the public domain so I can't
point you to an FTP site to get the code to check it out.  Nor is
particularly useful to describe the gory details here of how to extract
tags without a DTD or reduce the definition to something useful.

While I could also go into detail on all the tools we created to run off
these processes (DTD editor, a logical database description editor, and
interface editors), I'd rather move that stuff off-line and stick to a more
general discussion as to the need, benefit, or irrelevance of such tools to
the SGML community.

I would be quite interested in hearing about other instances where data
collections, large or small, could have been managed by SGML tools had an
automated DTD creation tool also been available.  I would also like to
communicate with others (directly where appropriate) who have found a need
for such tools or would be interested in furthering this research area.

Keith.
--
Keith Shafer, PhD              | 6565 Frantz Road   |
Senior Research Scientist      | Dublin, Ohio 43017 | shafer@oclc.org
Online Computer Library Center | 614-761-5049       |
</message>
<message id="<TPT.94May27161755@hpcerpa.ce.hp.com>" date="2979044275">
Newsgroups: comp.text.sgml,comp.text
Date: 27 May 1994 16:17:55 UT
From: Tim Temple \<tpt@hpcerpa.ce.hp.com>
Organization: Hewlett Packard, Software Initiative, 1501 Page Mill Road 5M-R, Palo Alto, CA 94304
Message-ID: \<TPT.94May27161755@hpcerpa.ce.hp.com>
References: <2s0cug$cbl@usenet.rpi.edu>
Subject: Re: PageMaker to HTML Converter

[Marc Edgar]

|   Is it possible to convert Frame docs using the Frame Interchange
|   Format?  That is, PageMaker -> RTF -> HTML?

If you're starting from Frame, there are a couple of frame -> html
converters: fm2html and WebMaker.

    fm2html: ftp bang.nta.no:pub/fm2html.tar.v.0.n.m.Z.
    WebMaker: http://www1.cern.ch/WebMaker/WEBMAKER.html
      (which shows how to get the package)

Hope this helps.

tt
--
Disclaimer: these remarks are personal and do not constitute and
official response by Hewlett-Packard, Company.  Use at your own risk.
</message>
<message id="<9405271730.AA24717@source.asset.com>" date="2979048636">
Newsgroups: comp.text.sgml
Date: 27 May 1994 17:30:36 UT
From: Len Bullard \<bullardc@source.asset.com>
Message-ID: <9405271730.AA24717@source.asset.com>
References: <2s51ot$6qb@oclc.org>
Subject: Re: Chicken and Egg DTD Problem

1.  What is about DTD creation that is tedious or difficult?

2.  If instance is marked up, how was the original markup derived and
    applied?

3.  How do tools to derive DTDs by example create inclusion/exclusion
    mechanisms?

4.  How do these tools account for character entity substitution which
    improves portability of marked up data?

I agree, tag extraction is easy.  Deciding what to *name* the tag is not
always simple.  Please explain the process by which one decides this, and
if post processes are applied to the initial derivation in order to provide
for more complex subsequent processing that marks up the same data for
multiple functional uses other than living inside the paint loop.

If it didn't come from an egg, it ain't no chicken.  But it can be a fish.
It seems you are working very hard to make chickens out of fish.  Once the
fish becomes fowl is it still fair?  Sorry, couldn't resist the bait. |-)
</message>
<message id="<9405271736.AA24761@source.asset.com>" date="2979048981">
Newsgroups: comp.text.sgml
Date: 27 May 1994 17:36:21 UT
From: Len Bullard \<bullardc@source.asset.com>
Message-ID: <9405271736.AA24761@source.asset.com>
References: <9405231805.AA62856@source.asset.com> <2rtdbq$hjj@nrcnet0.nrc.ca>
Subject: Re: ISO 10744

Thanks to all of you for posting the addresses for getting the ISO
information.  Actually, I have this document and posted the query at the
request of the US Army Missile Command folks who for some reason, didn't
want to do it themselves, proving once again that the civil servants would
perish without the contractors...and vice versa.
</message>
<message id="<JOLLIE.94May27125223@grant.cs.uiowa.edu>" date="2979049942">
Newsgroups: comp.text.sgml
Date: 27 May 1994 17:52:22 UT
From: "Jeffrey C. Ollie" \<jollie@grant.cs.uiowa.edu>
Organization: The University of Iowa, Iowa City, Iowa, USA
Message-ID: \<JOLLIE.94May27125223@grant.cs.uiowa.edu>
Subject: Is the FAQ being actively maintained?

Is the FAQ being actively maintained?  The version on ftp.ifi.uio.no is
over one year old.

--
Jeffrey C. Ollie
jeffrey-ollie@uiowa.edu
http://caesar.cs.uiowa.edu/~jollie/
</message>
<message id="<etxsvkl.770063986@garboa02>" date="2979052786">
Newsgroups: comp.text.sgml
Date: 27 May 1994 18:39:46 UT
From: Svante Kleist \<etxsvkl@garbo.ericsson.se>
Organization: Ericsson
Message-ID: \<etxsvkl.770063986@garboa02>
Subject: SGML editor for Japaneese?

Does anybody know about an SGML editor capable of handling Japanese text?
/Svante

--
Svante Kleist, Ericsson Telecom AB, Systems and Software - Design Support.
"Any correspondence between opinions, ideas or other thoughts expressed by me
and those held by my employer is usually nothing but a pure coincidence!"
</message>
<message id="<19940527.2529@naggum.no>" date="2979061733">
Newsgroups: comp.text.sgml
Date: 27 May 1994 21:08:53 UT
From: Erik Naggum \<erik@naggum.no>
Organization: Naggum Software; +47 2295 0313
Message-ID: <19940527.2529@naggum.no>
References: \<JOLLIE.94May27125223@grant.cs.uiowa.edu>
Subject: Re: Is the FAQ being actively maintained?

[Jeffrey C. Ollie]

|   Is the FAQ being actively maintained?  The version on ftp.ifi.uio.no is
|   over one year old.

the FAQ.readme file reads as follows:

    FAQ.0.0 is reinstated, with the change made 1992-01-15.

    The FAQ will be replaced by a four-part introduction, containing an
    introduction to the newsgroup, to the archive, and to other services in
    relation to comp.text.sgml (see the file "general-info"); an introduction
    to SGML; a list of standards related to SGML (see the file "standards"; by
    Heather Davenport); and a bibliography (see the file "bibliography"; by
    Robin Cover).

    FAQ.0.0 has been retained because it was very popular among users.  I have
    never been particularly happy with it, but I let it hang around.  Please
    send suggestions by e-mail.

the files referred to are as follows:

modes       size       last read            last write      filename
rw-mu      18877  1994-05-27 16:45:49  1993-04-30 01:03:19  FAQ.0.0
rw-mu        779  1994-05-27 20:30:39  1994-04-13 05:23:43  FAQ.readme
rw-mu     167974  1994-05-27 11:05:57  1992-01-20 22:50:08  bibliography
rw-mu      10767  1994-05-27 16:52:49  1994-02-26 13:46:59  general-info
rw-mu      18847  1994-05-27 11:05:57  1994-05-23 21:03:31  standards

(all times UT).

no, the FAQ is not actively being maintained.  however, the comp.text.sgml
archive is, and will continue to receive primary attention.  when it takes
about two hours a day just to keep the archive up and the mailing list
going, I don't feel any strong obligation to repeat myself in an FAQ.  the
archive has it all.  this does not preclude continuous updates, it just
means that I can't devote more time on this than I already do.  however,
nothing in the FAQ as it stands is _wrong_, and that's more than you can
say about any other "FAQ" that's supposedly addressing SGML.

it's ironic, but the only really frequently asked question on this
newsgroup is: "where's the FAQ?"  that only tells me people want to read
something and not ask intelligent questions they would like it to answer.
some have even gone so far as to be _annoyed_ that the FAQ doesn't answer
their unspoken questions, which they also refuse to disclose when asked.

I'd like to write something useful that people can read to understand SGML,
but it's a book-length project, and I'm devoting my time to that, instead.

my suggestion is to ask for whatever information you seek, not ask for an
FAQ to answer questions you don't want to ask for fear they are "frequently
asked".  in this sense, I think FAQ's are a disservice to the network
community, as it stops activity short by a hostile comment that whatever is
a "frequently asked questions", no matter whether the answer in the FAQ was
good enough or not.  _simple_ frequently asked questions are annoying, but
"frequently" doesn't mean once or twice in three months, and there haven't
been too many simple questions to this newsgroup at all -- apart from those
"where's the FAQ?" questions.
there are hundreds of competent people on this newsgroup who will eagerly
answer any question you might care to ask, but you have to ask it first.
as with most newsgroups, requests for specific help is often mailed to the
individuals seeking information, not posted.  questions about more general
aspects of SGML will be answered by me or Eliot Kimber.

posting a specific question on a newsgroups makes people aware that
somebody else had the same problem, and it is not uncommon to receive a
bunch of messages asking to share the answer with them.  USENET has always
worked like this, and the trend towards more mail is growing as the news
volume picks up.

so ask your questions without fear of posing a "frequently asked question"
and annoying people!  and fear not that you'll get flamed for asking stupid
questions.  only wrong _answers_ get flamed here.

now, what would you like the FAQ to tell you that it doesn't?

best regards,
\</Erik>
--
Erik Naggum \<erik@naggum.no> \<SGML@ifi.uio.no>  |  memento, terrigena.
ISO 8879 SGML, ISO 10744 HyTime, ISO 10646 UCS  |  memento, vita brevis.
</message>
<message id="<1994May27.143401.1077@vaxb.phx1.aro.allied.com>" date="2979063241">
Newsgroups: comp.text.sgml
Date: 27 May 1994 21:34:01 UT
From: Vic Fragnito \<b98429@vaxb.phx1.aro.allied.com>
Organization: AlliedSignal Engines
Message-ID: <1994May27.143401.1077@vaxb.phx1.aro.allied.com>
Subject: Looking for SGML User Group Chapter

Could someone point me in the direction of a SGML users group in Arizona,
preferably in Phoenix.  In a previous posting, there was a mention to an
Arizona chapter of an SGML users group without a contact point.  I would
appreciate information on how to contact them.

--
Vic Fragnito  (602) 231-1213
AlliedSignal Engines
Phoenix, Arizona
</message>
<message id="<CqInI8.CnI@demon.co.uk>" date="2979123200">
Newsgroups: comp.text.sgml
Date: 28 May 1994 14:13:20 UT
From: David MacDonald \<david@folio.demon.co.uk>
Message-ID: \<CqInI8.CnI@demon.co.uk>
References: <1994May24.213549.12011@midway.uchicago.edu>
Subject: Re: SGML editors

[Richard L. Goerwitz]

|   ...

I'm interested in any answers to Richard's question also.  (In particular,
are there any SGML editors/tools that are based on the Macintosh operating
system?

--
David MacDonald          david@folio.demon.co.uk
Perth   Scotland
</message>
<message id="<MAGNUS.94May28184736@haukugle.ii.uib.no>" date="2979132456">
Newsgroups: comp.text.sgml
Date: 28 May 1994 16:47:36 UT
From: Magnus Y Alvestad \<magnus@haukugle.ii.uib.no>
Organization: Department of Informatics
Message-ID: \<MAGNUS.94May28184736@haukugle.ii.uib.no>
References: <1994May24.213549.12011@midway.uchicago.edu> \<CqInI8.CnI@demon.co.uk>
Subject: Re: SGML editors

An SGML-aware 'editor' for the Mac:

    Author/Editor.

-Magnus
</message>
<message id="<2s8bs3$h3p@cmcl2.NYU.EDU>" date="2979148099">
Newsgroups: comp.text.sgml,comp.text,comp.lang.misc,comp.lang.icon
Date: 28 May 1994 21:08:19 UT
From: Olivier Lefevre \<lefevre@ACFcluster.NYU.EDU>
Organization: New York University, NY, NY
Message-ID: <2s8bs3$h3p@cmcl2.NYU.EDU>
References: <1994May17.140242.27932@tin.monsanto.com> \<CpyDI0.M9M@world.std.com> \<CqBJ8u.B6s@unx.sas.com> <1994May24.212757.11431@midway.uchicago.edu> <9405252345.AA3100@notes.notes> \<CqEtM6.I89@unx.sas.com>
Subject: Re: Spitbol reference?

[Gary Merrill]

|   ... SNOBOL (and LISP, and APL) are *very* different from "normal"
|   programming languages.  Many (most?)  programmers never come to
|   understand or like them.

I hope I may be forgiven for jumping into this thread.  I was weaned on
FORTRAN and recently moved to APL2 (which I do not find confusing at all,
BTW, nor restricted to writing prototypes) and I am developing an interest
in the "zoology" of programming languages.  Are there any good references
on this topic, in the comparative-anatomy-of-programming-languages style?
I've seen a few on the bookshelves of the local Barnes & Nobles store but I
would appreciate some informed advice.

Thanks beforehand

Olivier Lefevre
Hospital for Joint Diseases, NYU Medical School, NYU
</message>
<message id="<2scbur$anu@krakatoa.iconix.oz.au>" date="2979279259">
Newsgroups: comp.text.sgml
Date: 30 May 1994 09:34:19 UT
From: Stuart McCormack \<smac@iconix.oz.au>
Organization: Iconix Pty Ltd (World Headquarters)
Message-ID: <2scbur$anu@krakatoa.iconix.oz.au>
References: \<JOLLIE.94May27125223@grant.cs.uiowa.edu> <19940527.2529@naggum.no>
Subject: Rules for an SGML FAQ

[Erik Naggum]

|   my suggestion is to ask for whatever information you seek, not ask for
|   an FAQ to answer questions you don't want to ask for fear they are
|   "frequently asked".  in this sense, I think FAQ's are a disservice to
|   the network community, as it stops activity short by a hostile comment
|   that whatever is a "frequently asked questions", no matter whether the
|   answer in the FAQ was good enough or not.  _simple_ frequently asked
|   questions are annoying, but "frequently" doesn't mean once or twice in
|   three months, and there haven't been too many simple questions to this
|   newsgroup at all -- apart from those "where's the FAQ?" questions.

I'ts interesting to note that "Where's the FAQ?" is such an unintelligent
and unimaginative question that, really, the dweebs who would stoop to
asking have no right to be answered except in a supercilious and
egotistical manner.

Why, we ask, would an ignorant who knows nothing about SGML except that it
exists be so gutless as to want to read an FAQ which might, in a nutshell,
explain precisely what SGML is, what it's for, where it's going, and a few
other meaningless bits of trivial information that would be of absolutely
no use whatever in framing intelligent questions?  God forbid we should
ever post the thing.

And because we personally are so intimate in all aspects of SGML lore we
see know reason why a two year old version that you can't find is more than
sufficient for your puny needs.  Instead we plan to write a magnificent
treatise explaining in pedantic detail all those things you ignorami always
wanted to know but were too gutless or stupid to ask.  It's sure to be a
best seller.

You lucky punters must be quivering with desire!

For the time being, however, we plan to continue with our two golden rules,
being:

1.  Always look down your nose at anyone who knows less than you, and

2.  Never give the customer what they ask for.  (We know better, after
    all.)

We spend two hours a day maintaining the precious archive that you'd know
all about if we could be bothered posting an FAQ that described it, so
don't get shitty with us, you nerds.

	The Management
</message>
<message id="<19940530.2562@naggum.no>" date="2979290096">
Newsgroups: comp.text.sgml
Date: 30 May 1994 12:34:56 UT
From: Erik Naggum \<erik@naggum.no>
Organization: Naggum Software; +47 2295 0313
Message-ID: <19940530.2562@naggum.no>
References: \<JOLLIE.94May27125223@grant.cs.uiowa.edu> <19940527.2529@naggum.no> <2scbur$anu@krakatoa.iconix.oz.au>
Subject: Re: Rules for an SGML FAQ

[Stuart McCormack]

|   For the time being, however, we plan to continue with our two golden
|   rules, being:
|
|   1)  Always look down your nose at anyone who knows less than you, and
|
|   2)  Never give the customer what they ask for.  (We know better, after
|       all.)
|
|   We spend two hours a day maintaining the precious archive that you'd
|   know all about if we could be bothered posting an FAQ that described
|   it, so don't get shitty with us, you nerds.

thank you for your generous support of the SGML archive.  it is for users
like you I do all this, and it is of course the enormous emotional rewards
I get from your postings that continue to drive me in my quest to keep the
whole thing up to date and useful.

thank you also for asking an intelligent and useful question.  your help in
building a base on which an FAQ can be written to answer frequently asked
questions is strongly appreciated.  without your postings, an FAQ would be
certain to contain much less useful information.

I understand that you're a complete beginner, and it is of course hard for
you to realize that of the 4500 articles in the archive, you will find at
least 500 from me, and at least 450 of these contains information that
others before you have found "useful" to "very useful".  those who think so
are people who have asked intelligent and specific questions, as opposed to
have demanded to get some service in this joint.  mostly, of course, this
is because you don't look for the information when pointers to it are
published right in front of your nose.  if you had cared to, you would have
found the information you wanted in the article you seem to have a very
hard time actually reading.  instead, you, like countless others, continue
to behave like world-class losers.  it is for the whining losers that FAQs
were first written, specifically so they wouldn't destroy the peace and
quiet the experienced users would like to have for themselves.  I do not
subscribe to this view of the network users, but you give me a hard time,
Stuart McCormack.  I used to think that losers could not possibly be worth
writing for, but hey! after reading your article, I think maybe I should
write an excellent 50K FAQ for your specific needs.  you're inspirational,
you know that?

I also understand that newbies on the Net think other people there exist
for their pleasure, use and exploitation.  this is a development which I,
personally, think is quite alarming.  it means that people are prepared to
log in to _obtain_ information, but refuse to give anything useful back.
in the more extreme cases, people do not only expect information to be
there for their perusal, they _demand_ that it be.  I'm sorry, but this is
not how the Net works.  it works because _some_ people _give_ information
to others.  your task if you have a question is to make those people want
to give you the information.  nobody here is "The Management", nobody pays
for any single informative article published.  it is _all_ volunteer-based.
this also means nobody is a "customer" here.  the currency in this shop is
the goodwill among professionals.

if you don't like this, you really don't belong in a volunteer-run world.
you should pay for the information, instead, like a customer.  customers
who don't pay are usually shown the door.  so, you should put your money
where your mouth is and try to make this a commercially viable undertaking.
such as paying enormous sums of money for an SGML course on the syntax of
SGML, and nobody to ask questions afterwards.  such as the usual one-way
information you find in expensive and infrequently published magazines and
newsletters.  is this how you want USENET to be run?  do you know what you
have, and what you're asking for?

but I have a compromise solution for you: if you (Iconix Pty Ltd) put up
USD 10,000, I can devote a month to writing an FAQ, and maintaining it for
one year afterwards, to be available for anyone on the Net for free.  what
does the customer say?  too expensive?  well, sorry, but we don't run a
charity in this shop.  I've put in more than 2,500 hours in this archive
and the newsgroup to help people after I got my own kick-start from the
Net.  your 10 grand comes down to minimum wage for this time.  as customers
go, I don't think you can get it any cheaper than this.

please note that there is no SGML project at the U of Oslo which pays for
this archive, nor my time.  unlike the U of Exeter, this is not a funded
project (and even it has very unfortunately had its funding cut).  I have
asked the SGML Users' Group to cover my networking costs, but have so far
heard nothing.  it seems that if you do something for free long enough,
people will think you _should_ do it for free, and will refuse to pay you.
since the information is available for free, everybody thinks they can take
it, and nobody wants to pay for it.  obviously, this cannot work forever.
it is with some bitterness I note that the number of demands for free
information now outnumber the thank-you notes 5 to 1, and those I ask to
help defray the cost never respond.

best regards,
\</Erik>
--
Erik Naggum \<erik@naggum.no> \<SGML@ifi.uio.no>  |  memento, terrigena.
ISO 8879 SGML, ISO 10744 HyTime, ISO 10646 UCS  |  memento, vita brevis.
</message>
<message id="<2sdtcl$3s9@news.delphi.com>" date="2979302024">
Newsgroups: comp.text.sgml
Date: 30 May 1994 15:53:44 UT
From: Jeffrey McArthur \<j_mcarthur@BIX.com>
Organization: Galahad
Message-ID: <2sdtcl$3s9@news.delphi.com>
Subject: Re: Rules for an SGML FAQ

Let me ask a simple question: How do I introduce SGML to to a novice,
someone who has never heard of SGML before?

If someone asks me about TeX, I can tell them to read the FAQ for
comp.text.tex.  If someone asks me the differences between Modula-2,
Oberon, and Oberon-2, those are listed in the FAQ for comp.lang.modula.2
and comp.lang.oberon.

If I want to introduce someone to literate programming, I can give them the
FAQ for comp.lang.literate and it provides a good introduction to the
subject.

Maintaining a FAQ is a complex and difficult task.  But it does not need to
be the work of only one person.  comp.text.tex handles the problem is a
rather unique way.  One person sat down and created an outline of all the
major highlight.  Then asked for contributions for each section.  There are
still major section which read something like: "waiting for someone who is
willing to write a section on this".  But then comp.text.tex covers such a
vast area.  SGML is small by comparison.  For example the number of
implementations of parser for SGML for the Atari ST is quite small (0, I
beleive).  There are only 2 or 3 implementations of TeX for the Atari ST.
About the same number for the Amiga.

Telling the user to go search the archive makes one major assumption: the
person has access to the internet.  With a FAQ it is possible to provide a
single file to someone to get them started.

For SGML to continue to florish, new users need to be introduced to the
subject.  If SGML is even PRECEIVED as yet-another-propritary-format,
people will not want to move to SGML.

I have kicked around the internet for several years now.  If I find a
newsgroup that looks interresting I will start reading it.  Most newsgroups
post the FAQ about once a month.  If after two months of reading I have not
seen an FAQ I would ask where I could obtain one.  If I had done that with
this group, I now see the greeting I would have received.

All I can say is: what a wonder way to introduce new users to SGML.

--
    Jeffrey M\\kern-.05em\\raise.5ex\\hbox{\\b c}\\kern-.05emArthur
    a.k.a. Jeffrey McArthur          ATLIS Publishing
    phone: (301) 210-6655            12001 Indian Creek Court
    fax:   (301) 210-4999            Beltsville, MD  20705
    home:  (410) 290-6935            email: j_mcarthur@bix.com

The opinions express are mine and mine alone.  They do not reflect
the opinions of ATLIS Publishing or any part of ATLIS Systems.
ATLIS does NOT pay for my access to the Internet.
</message>
<message id="<CqMJ0D.3q6@unx.sas.com>" date="2979303997">
Newsgroups: comp.text.sgml,comp.text,comp.lang.misc,comp.lang.icon
Date: 30 May 1994 16:26:37 UT
From: Gary Merrill \<sasghm@theseus.unx.sas.com>
Organization: SAS Institute Inc.
Message-ID: \<CqMJ0D.3q6@unx.sas.com>
References: <1994May17.140242.27932@tin.monsanto.com> \<CpyDI0.M9M@world.std.com> \<CqBJ8u.B6s@unx.sas.com> <1994May24.212757.11431@midway.uchicago.edu> <9405252345.AA3100@notes.notes> \<CqEtM6.I89@unx.sas.com> <2s8bs3$h3p@cmcl2.NYU.EDU>
Subject: Re: Spitbol reference?

[Olivier Lefevre]

|   I hope I may be forgiven for jumping into this thread.  I was weaned on
|   FORTRAN and recently moved to APL2 (which I do not find confusing at
|   all, BTW, nor restricted to writing prototypes) and I am developing an
|   interest in the "zoology" of programming languages.  Are there any good
|   references on this topic, in the comparative-anatomy-of-programming-
|   languages style?  I've seen a few on the bookshelves of the local
|   Barnes & Nobles store but I would appreciate some informed advice.

I have not seen one that I felt was sufficiently detailed, but this is more
likely a comment on my own ignorance than anything else.  I can say that if
you ever get interested in LISP, you definitely should read _The Anatomy of
LISP_ by John Allen (McGraw-Hill).  This is perhaps my farorite book in the
are of programming languages.  It is interesting that in large part the
author intended the book as an introductory text in data structures.

--
Gary H. Merrill  [Principal Systems Developer, Compiler and Tools Division]
SAS Institute Inc. / SAS Campus Dr. / Cary, NC  27513 / (919) 677-8000
sasghm@theseus.unx.sas.com ... !mcnc!sas!sasghm
</message>
<message id="<9405301636.AA04726@ipsi.darmstadt.gmd.de>" date="2979304567">
Newsgroups: comp.text.sgml
Date: 30 May 1994 16:36:07 UT
From: Klemens Boehm \<kboehm@darmstadt.gmd.de>
Message-ID: <9405301636.AA04726@ipsi.darmstadt.gmd.de>
Subject: Sample SGML DTDs and Documents

Hello,

In order to test an SGML application we developed I am interested in sample
SGML documents and DTDs.  The structure of the DTD should be more
interesting, e.g., more complex, than, say, the HTML DTD.  I would also
need a fair amount of documents conformant to the DTD, and, finally, it
would help us a lot if the documents' content was not too special, i.e.,
understandable for an outsider.  Any hints are sincerely appreciated.

Thank you very much for your attention.

Best regards,

Klemens Boehm

--
$	Klemens Boehm							      $
$	GMD-IPSI			Phone: +49 6151 869 963		      $
$	Dolivostrasse 15		Fax:   +49 6151 869 818		      $
$	64293 Darmstadt			E-mail: kboehm@darmstadt.gmd.de	      $
$	Germany								      $
</message>
<message id="<2sda2oINNj5e@phage.cshl.org>" date="2979310104">
Newsgroups: comp.text.sgml,comp.text,comp.lang.misc,comp.lang.icon
Date: 30 May 1994 18:08:24 UT
From: William Chang \<wchang@phage.cshl.org>
Organization: Cold Spring Harbor Lab, NY
Message-ID: <2sda2oINNj5e@phage.cshl.org>
References: <1994May17.140242.27932@tin.monsanto.com> \<CqEtM6.I89@unx.sas.com> <2s8bs3$h3p@cmcl2.NYU.EDU> \<CqMJ0D.3q6@unx.sas.com>
Subject: Re: Spitbol reference?

[Olivier Lefevre]

|   I hope I may be forgiven for jumping into this thread.  I was weaned on
|   FORTRAN and recently moved to APL2 (which I do not find confusing at
|   all, BTW, nor restricted to writing prototypes) and I am developing an
|   interest in the "zoology" of programming languages.  Are there any good
|   references on this topic, in the comparative-anatomy-of-programming-
|   languages style?  I've seen a few on the bookshelves of the local
|   Barnes & Nobles store but I would appreciate some informed advice.

[Gary Merrill]

|   I have not seen one that I felt was sufficiently detailed, but this is
|   more likely a comment on my own ignorance than anything else.  I can
|   say that if you ever get interested in LISP, you definitely should read
|   _The Anatomy of LISP_ by John Allen (McGraw-Hill).  This is perhaps my
|   farorite book in the are of programming languages.  It is interesting
|   that in large part the author intended the book as an introductory text
|   in data structures.

I agree.  This book (the first one I ever bought), plus a handful of old
CACM papers on function closures (Baker), memory management, garbage
collection, etc, would do quite nicely.  I also found Griswold's _Macro
Implementation of SNOBOL_ to be interesting.  The best way to read about
these things is to browse an old, big library such as Courant Institute or
New York midtown library.  Or browse old MIT AI Memos (if you are there); a
lot of stuff on language "philosophies".  The journal _Software: Practice
and Experience_ is one of my favorite.

My favorite APL book is by Adian Smith, _APL: A Design Handbook for
Commercial Systems_, long out of print.  But a new edition is coming out
soon!

There's also Abelson & Sussman's _Structure and Interpretation of Computer
Programs_ (Scheme).

-- Bill Chang (wchang@cshl.org)
</message>
<message id="<OZ.94May30140713@ursa.sis.yorku.ca>" date="2979313633">
Newsgroups: comp.text.sgml,comp.text,comp.lang.misc,comp.lang.icon
Date: 30 May 1994 19:07:13 UT
From: "Ozan S. Yigit" \<oz@ursa.sis.yorku.ca>
Organization: The Electric Skillet
Message-ID: \<OZ.94May30140713@ursa.sis.yorku.ca>
References: <1994May17.140242.27932@tin.monsanto.com> \<CpyDI0.M9M@world.std.com> \<CqBJ8u.B6s@unx.sas.com> <1994May24.212757.11431@midway.uchicago.edu> <9405252345.AA3100@notes.notes> \<CqEtM6.I89@unx.sas.com> <2s8bs3$h3p@cmcl2.NYU.EDU> \<CqMJ0D.3q6@unx.sas.com>
Subject: Re: Spitbol reference?

[Gary Merrill]

|   I can say that if you ever get interested in LISP, you definitely
|   should read _The Anatomy of LISP_ by John Allen (McGraw-Hill).  This is
|   perhaps my farorite book in the are of programming languages.

I second the recommendation with a note: this book is a lot of fun and pays
off close study, but it is (sigh) dated.  Friedman/Wand/Haynes [1] book is
the closest thing to an up-to-date version.  [yeah, I like SICP too, but in
this case EOPL wins for detail and coverage.]

oz
---
[1] Daniel P. Friedman, Mitchell Wand and Christopher T. Haynes
    Essentials of Programming Languages
    MIT Press and McGraw-Hill
    1992
</message>
<message id="<Cqn0sJ.Ao7@demon.co.uk>" date="2979327042">
Newsgroups: comp.text.sgml
Date: 30 May 1994 22:50:42 UT
From: David MacDonald \<david@folio.demon.co.uk>
Organization: TMP
Message-ID: \<Cqn0sJ.Ao7@demon.co.uk>
References: \<JOLLIE.94May27125223@grant.cs.uiowa.edu> <19940530.2562@naggum.no> <2sdght$3i4@felix.dircon.co.uk>
Subject: Re: Is the FAQ being actively maintained?

[Stuart McCormack]

|   ...

Well, I will not bother to repeat the load of rubbish that Stuart McCormack
wrote.  Erik wrote an excellent reply which says it all - if Stuart wants
instant information then he should be prepared to pay for it.  Some people
seem to be 4 years old for life!  Grow up Stuart or shut up.

--
David MacDonald              david@folio.demon.co.uk
Perth       Scotland
</message>
<message id="<2979427003.1.p01292@psilink.com>" date="2979329910">
Newsgroups: comp.text.sgml
Date: 30 May 1994 23:38:30 UT
From: The Prescott Group \<p01292@psilink.com>
Organization: Search
Message-ID: <2979427003.1.p01292@psilink.com>
Subject: jcals/sgml/dtd mngmt oppty

Manager JCALS, SGML, DTD's

Call Pat +1 609 354 7651
</message>
<message id="<9405310805.AA15325@ditsydh.syd.dit.csiro.au>" date="2979360338">
Newsgroups: comp.text.sgml
Date: 31 May 1994 08:05:38 UT
From: Nick Carr \<carr@syd.dit.csiro.au>
Message-ID: <9405310805.AA15325@ditsydh.syd.dit.csiro.au>
Subject: Job Available

I am posting this for a colleague who does not have access to the Net.  If
responding by mail, please use this address.  If you are interested in
further details please fax +61 2 313 7518.  Due to starting requirements it
is essential that the applicant be an Australia citizen or hold a current
Australian work permit.

The position available is for an analyst/programmer with SGML and C++
experience.  Knowledge of DynaText and OmniMark would be extremely
desirable.  Remuneration will be commensurate with experience.

--
Nick Carr
Allette Systems
Sydney  NSW  Australia
</message>
<message id="<2sff4n$lne@oclc.org>" date="2979380823">
Newsgroups: comp.text.sgml
Date: 31 May 1994 13:47:03 UT
From: Keith Shafer \<shafer@oclc.org>
Organization: OCLC Online Computer Library Center, Inc.
Message-ID: <2sff4n$lne@oclc.org>
References: <2s51ot$6qb@oclc.org> <9405271730.AA24717@source.asset.com>
Subject: Re: Chicken and Egg DTD Problem

[Len Bullard]

|   1.  What is about DTD creation that is tedious or difficult?

We allow optional tagging to some degree.  Without a DTD, you must
determine where the optional end tags occur.  Also, we have found that some
tags need to be empty.  For instance, our Tandem documentation contains
things like \<process> where this is not a tag, but a variable.  So, the
extraction process has to be able to handle this as well.

I guess it's only "tedious" in that you have to allow different things like
this and provide hooks so our multiple interfaces built on top of the
engine allow the user to either set this kind of mismatch control once for
a given run, or interact with the engine on the fly when mismatches are
found.  Plus, without a DTD as a guide, how do you know what's optional,
etc?

The other catch is that once you have all the extracted tags and translate
them to a DTD, the DTD is often too "complex" for most SGML parsers.  (ie,
while it's grammatically correct, it doesn't meet the parsing limits set by
SGML parsers.)  So, we provide grammatical reduction to reduce the number
of rules and their complexity, hopefully without extending the intended
grammar.


|   2.  If instance is marked up, how was the original markup derived and
|       applied?

Some comes from simple translations to publisher tapes.  So, there's a
translation table of sorts, but the underlying structure may not be known.
(Often, this translation is NOT done by us.)  I know that this seems weird
at first, but we see it.  That is, the publisher says, yeah, this thing foo
can be tagged as \<bar>, but they don't really know what the context of foo
is in the original, so \<bar> could appear many places, at many points in
the structure.

As for authoring, novices can easily add some tagging to documents, like
\<author>, \<title>, etc, without having a specific structure in mind.  We
have seen this sort of thing in projects where raw textual information is
going to be translated, but there was no pre-defined structure.  By
allowing the ability to build the DTD by example, we were able to jump the
DTD discovery process.

|   3.  How do tools to derive DTDs by example create inclusion/exclusion
|       mechanisms?

Currently, they do not.  We have some ideas as to when you would like to
programmatically create these, but we are definitely open to suggestions.
In many cases, inclusion/exclusion can be viewed as a means of reducing the
complexity of a DTD by increasing its readability.  So, if we had some
agreement on some parameters for seeing this complexity, like, "this tag
occurs in 90% of the rules or appears three times in a given rule, make it
an inclusion", we could introduce inclusion.  I'm not sure then if you'd
mark the others as exclusions, or if inclusion/exclusion should be
discovered in other ways.  We are still looking into this and need more
samples/input to guide the decision.

|   4.  how do these tools account for character entity substitution which
|       improves portability of marked up data?

Currently, entity substitutions like this are ignored.  These definitions
have to be added by hand, as it were.  The created DTD has a catch-all
entity in it so as to still be "correct".  I should point out that the real
focus of our research is building databases, not deriving "perfect" DTD's,
since the database building processes we use (and the documents we see)
often do not take advantage of all the power of SGML.  For that matter, if
you don't have a DTD, you can't use the full power of SGML.  The problem we
have, is that we get fairly "clean" markup and need a structural guide for
a database build.

|   I agree, tag extraction is easy.  Deciding what to *name* the tag is
|   not always simple.  please explain the process by which one decides
|   this,

We do not introduce new tags into the text - or *name* them ourselves.
There are many ways to do this, and many companies that help do this.  We
are strictly interested in extracting the structure that's already there.
Once again, I that this is very counter to the traditional approach of
building documents after a DTD is determined.  However, when dealing with
previously created materials, this is not always possible.

|   It seems you are working very hard to make chickens out of fish.

One could argue that, especially if you haven't seen counter examples.
But, I have see some fish around here with wings ;)

From your prospective, is there no reason to build tools that aid in DTD
creation from marked up text?  Or, how do you/your company use SGML?

Keith.

(BTW, this is a copy of my response I sent directly to Len.  I didn't
realize he had posted this to the newsgroup 'til this morning - I was
receiving mostly personal responses to my article posting.)
</message>
<message id="<2sfv9h$3tq@dns1.NMSU.Edu>" date="2979397361">
Newsgroups: comp.text.sgml
Date: 31 May 1994 18:22:41 UT
From: Richard Chavez \<mfsrc@nmsu.edu>
Organization: New Mexico State University, Las Cruces, NM
Message-ID: <2sfv9h$3tq@dns1.NMSU.Edu>
Subject: GML to WordPerfect

Does anyone know of an translator for GML/DCF to
WordPerfect/Word?  Any info would be apprecitated.

--
Richard M. Chavez
mfsrc@nmsu.edu
CANTO Information Center
New Mexico State University
Las Cruces, N.M.
+1 505 646 1840
</message>
<message id="<199405312048.AA06251@naggum.no>" date="2979406062">
Newsgroups: comp.text.sgml
Date: 31 May 1994 20:47:42 UT
From: Jon Wise \<J_WISE@cheltenham-he.ac.uk>
Message-ID: <199405312048.AA06251@naggum.no>
Subject: SGML, HyTime and strong type

I am beginning some research into the use of strong typing within large
hypertext documents and am looking for some relevent discussion.  The
formality of SGML and HyTime appeals to my view that a rigorous approach to
computing will deliver the long-term benefits.

Thanks for any help you can provide.

--
Jon Wise
Senior Lecturer
Cheltenham and Gloucester College of Higher Education
England
</message>
<message id="<TED.94May31212214@lole.crl.nmsu.edu>" date="2979408134">
Newsgroups: comp.text.sgml
Date: 31 May 1994 21:22:14 UT
From: Ted Dunning \<ted@crl.nmsu.edu>
Organization: Computing Research Lab
Message-ID: \<TED.94May31212214@lole.crl.nmsu.edu>
References: <2s51ot$6qb@oclc.org> <9405271730.AA24717@source.asset.com> <2sff4n$lne@oclc.org>
Subject: Re: Chicken and Egg DTD Problem

here is a specific related to the general problem of legacy texts that we
have actually run into just recently.  as an SGML novice I would be very
interested in concrete suggestions which fit into our constraints.

[Keith Shafer]

|   [... talking about deriving dtd's after getting a bunch of text ...]
|
|   > It seems you are working very hard to make chickens out of fish.
|
|   One could argue that, especially if you haven't seen counter examples.
|   But, I have see some fish around here with wings ;)

I have been working on exactly this problem lately.  we have a plethora of
on-line resources with the following characteristics:

1) some are "SGML", some are not.  those that appear to be SGML generally
   don't have DTD's associated with them and those that are not are
   generally marked up to some degree.

2) new versions come out *all* the time.  this makes it a bit cumbersome to
   keep translating the documents to sgml.

3) some are so large that we have to keep them on the original CD-ROMS.
  this means that we have to deal with the original format somehow.

as a canonical and very simple example, we can take /usr/dict/words.  the
format is simple; the file consists of a list of words, each separated by
new-lines.  so, surely we can make up a DTD which just views this as a
highly minimized SGML file.

here is a short extract of the file:

10th
1st
2nd
3rd
4th
5th
6th
7th
8th
9th
a
AAA
AAAS
Aarhus
Aaron
AAU
ABA
Ababa
aback
abacus

if we use the following dtd with this file, sgmls marks this as a
conforming document.

\<!doctype dict [
\<!ELEMENT dict		O O   (word+)>
\<!ELEMENT word		O -   (#PCDATA)>

\<!ENTITY	wstart	 STARTTAG	"word" -- Word start-tag -->
\<!ENTITY	wend	 ENDTAG		"word" -- Word end-tag -->

\<!SHORTREF	dictmap  '&#RS;'	wstart -- Word start-tag --
                	 '&#RE;'	wend   -- Word end-tag -->

\<!USEMAP	dictmap	 dict>
]>

unfortunately, later in the file we find AT\&T, A\&P, IT\&T and so on.

so how can we deal with this?
</message>
<message id="<Pine.3.89-NWN.9405311400.11382S-0100000@mail.nwnet.net>" date="2979409749">
Newsgroups: comp.text.sgml
Date: 31 May 1994 21:49:09 UT
From: Laurence Lundblade \<lgl@nwnet.net>
Organization: A poorly-installed InterNetNews site
Message-ID: \<Pine.3.89-NWN.9405311400.11382S-0100000@mail.nwnet.net>
Subject: Re: Preprocessing, was: variable anchor/tags in HTML documents (fwd)

 [ I'm posting this here as well as to comp.infosystems.www since my news
   poster isn't doing cross-posting too well at the moment.  Some SGML
   expertise is much appreciated! ]

Yes, this sounds great!  I'm very interested in generating different pages
based on the IP address and/or domain name of the incoming client.  Clearly
the C preprocessor is not going to work very well, so it struck me that
some SGML construct is the best way to do it. For example:

  \<conditional tag=edu>This is for all those universities and colleges
  \</conditional>
  \<conditional tag=com>Businesses will be interested in this\</conditional>

Somehow the tags edu and com would have to be defined or not defined.  I'm
not really sure how to do this -- please pardon my SGML (and feel free to
point me to more information; I'm almost ready to buy the Goldfarb book).

I'm not sure, though, how this will avoid CGI.  One still has to generate
the tags and one is likely to want access to all the environment variables
that CGI provides to generate these tags, no?

Laurence Lundblade
  lgl@nwnet.net (summer)              lgl@csgrad.cs.vt.edu        703-552-2537
     NorthWestNet, Seattle (summer)      Virginia Tech -- Blacksburg, Virginia


[Paul Chamberlain] (comp.infosystems.www)

|   Corne van Delft \<code66@earl.hacktic.nl> wrote:
|   >I wonder if there is a possibility to have variable anchors in a
|   >HTML document and to set/change these variables in a HTML-document.
|
|   I've been thinking about an idea that solves this and a variety of
|   other "conditional text" type problems.  Let me think out loud here...
|
|   What if I used a CGI-bin script to retrieve ALL of my documents
|   (at least in the document tree of interest)?  You could use a URL
|   something like "http://server.domain/cgi-bin/getdoc/path/to/doc.html".
|   Now imagine that the CGI-bin preprocessed this document in some way.
|   You write a document that says something like:
|
|   	The \<a href=index.html?ARGS>Table of Contents\</a> includes
|   	#if (ARGS = TEACHERS)
|   	a suggested teaching plan.
|   	#endif
|   	#if (ARGS = STUDENTS)
|   	a list of homework assignments.
|   	#endif
|
|   Let's say we used "http://server/cgi-bin/getdoc/class1/intro.html?STUDENTS"
|   to originally refer to this document.  The "getdoc" script could
|   parse the EXTRA_PATH to find the document and preprocess it with
|   a "-DARGS=STUDENTS" option.  All the documents would then be written
|   with these preprocessing directives where necessary.  Some notes:
|
|     1.	The C preprocessor isn't designed to work with free text.
|     2.	I believe relative paths will still work using this.
|     3.	Obviously, other parameter passing options are possible.
|
|   Hmm, if document preprocessing-processing were built into the HTTPD:
|
|     1.	We wouldn't have to use ugly cgi-bin paths to do this
|     2.	Includes could be done outside of the daemon
|     3.	Last-time-modified hacks would not be in the daemon
|     4.	Conditional text would be supported (outside the daemon)
|     5.	Entire headers and footers could be generated automatically
|   	(signature, copyrights, feedback-link, homepage-link, etc.)
|
|   This seems like an easy addition to the .htaccess file.  A
|   preprocessor could be specified at the directory level.
|   Obviously a "sample preprocessor" should be provided which
|   has a few of the most popular functions.
|
|   Have I lost it or is this a great idea?!
|   --
|   Paul Chamberlain  |  \<I do NOT speak for IBM!>	| UNIX: tif@austin.ibm.com
|   IBM AWS, I/O Dev  |  512/838-3759, IBM T/L 678	| IBM-VNET:  TIF AT AUSTIN
</message>
<message id="<2sgd0i$ot3@crl.crl.com>" date="2979411410">
Newsgroups: comp.text.sgml
Date: 31 May 1994 22:16:50 UT
From: Joe English \<jenglish@crl.com>
Organization: Helpless people on subway trains
Message-ID: <2sgd0i$ot3@crl.crl.com>
Subject: LINK process chains

The SGML handbook talks about "chains of link processes", but I can't
figure out how they're specified.

For example, given document types D1, D2, and D3, where D1 is the source
document type, am I correct in assuming that the link process declarations:

    \<!LINKTYPE L1.2 D1 D2 ... >
    \<!LINKTYPE L2.3 D2 D3 ... >

specify that the source document instance is processed by 'L1.2' to produce
an instance of type 'D2', which is in turn processed by 'L2.3' to produce a
'D3' instance?  Or is this "application-defined"?

To complicate things, what happens if you add

    \<!DOCTYPE D2A ... >
    \<!LINKTYPE L1.2A D1 D2A ... >
    \<!LINKTYPE L2A.3 D2A D3 ... >

Is this illegal?  Implementation-defined?  Or does it specify that *two* D3
document instances are produced, one by the chain D1--L1.2-->D2--L2.3-->D3
and one by the chain D1--L1.2A-->D2A--L2A.3-->D3 ?

If the LPDs specify two D3 instances, and you then add:

    \<!DOCTYPE D4 ...>
    \<!LINKTYPE L3.4 D3 D4 ...>

how many D4 instances should be produced, and by what chains?

If (as I suspect) all of this is application-defined, would it be
reasonable for an application to enforce the restriction that "no document
type may be specified as the result document type in more than one explicit
link type declaration?"

I can't find any such restriction in the standard, but it seems that it
would clarify things some.

One final question: has anyone implemented explicit link processes?

--Joe English

  joe@trystero.art.com
</message>
