From: nikb@aladdin.co.uk (Nick barron)
Newsgroups: comp.text.sgml
Subject: Re: Recommendations wanted: editors for post-processing automated conversions
Date: Mon, 12 Feb 1996 07:34:22 GMT
Organization: SoNet - The first Internet provider on the south coast
Message-ID: <4fluio$c8t@news.aladdin.co.uk>
References: <4e3sfi$fsu@news.aladdin.co.uk> <310F87ED.1A19@spirit.com.au> <3114077B.22F7@passage.com>
"W. Eliot Kimber" \ wrote:
>I would agree that Author/Editor is well suited to this task (I depend on it).
Well, I was pushed to make a snap decision, and went for Author-Editor
-- looks like it`ll do the job nicely.
Many thanks for the advice.
From: asengupt@indiana.edu (Arijit Sengupta)
Newsgroups: comp.text.sgml
Subject: Re: nsgmls 1.0 and standard input
Date: 12 Feb 1996 00:53:24 GMT
Organization: Indiana University, Bloomington
Message-ID: <4fm324$kq7@usenet.ucs.indiana.edu>
References: \
Steve Heaney (heaney@cambridge.scr.slb.com) wrote:
: I've recently compiled and built the 1.0 version of nsgmls for Solaris 2.4
: and it works
: fine apart from one problem. When reading from standard input it
: consistently dumps
: core.
:
Use version 1.01 or get the patch from ftp.jclark.com or www.jclark.com
This was a known problem in 1.0. 1.01 fixes this.
Jit.
From: connolly@w3.org (Dan Connolly)
Newsgroups: comp.text.sgml,alt.hypertext
Subject: Re: Strongly-Typed Hyperlinks
Date: 12 Feb 1996 01:27:46 -0500
Organization: W3C
Sender: connolly@beach.w3.org
Message-ID: \
References: <31164A18.25C4@passage.com> <4f6onv$t18@Venus.mcs.com>
In-reply-to: jorn@MCS.COM's message of 5 Feb 1996 23:25:19 -0600
In article <4f6onv$t18@Venus.mcs.com> jorn@MCS.COM (Jorn Barger) writes:
> To make claims about the superiority of a hypertext
> design strategy without simulating it as far as possible
> on the WWWeb is ***scientifically irresponsible***
You don't even have to simulate it. Typed links have been a standard
architecture feature of the web since its inception. Popular
user agents just don't seem to exploit the feature.
A search for "typed link" in the www-talk archives:
http://www.eit.com/cgi-bin/wwwwais?keywords=typed+link\&selection=WWW-Talk%2C+1991-3\&maxhits=40
yields such goodies as:
http://www.eit.com/goodies/lists/www.lists/www-talk.1993q2/0282.html
HTML link "Made" relationship
Tim Berners-Lee (timbl@www3.cern.ch)
Tue, 11 May 93 16:46:44 +0100
>
> Can you cite *any* existing WWWeb site
> that would be more useful with typed links???
I experimented with typed links in producing the HTML renidition of the HTML 2.0 spec.
In that spec, evern glossed term is linked. It sure would be nice to have user
agents that could deemphasize those gloss links. Here's an example of the markup:
http://www.w3.org/pub/WWW/MarkUp/html-spec/html-spec_5.html#SEC5.7.3
\\SRC\
\
specifies the \URI\ of the image resource.
\(23)\
> And you allege that this will *add value*... but let's see some
> evidence.
Unfortunately, all I really have right now is an intuition that typed links
provide for richer knowledge capture and exchange, which is a Good Thing.
Dan
--
Daniel W. Connolly "We believe in the interconnectedness of all things"
Research Scientist, MIT/W3C PGP: EDF8 A8E4 F3BB 0F3C FD1B 7BE0 716C FF21
\ http://www.w3.org/pub/WWW/People/Connolly/
From: jon seymour \
Newsgroups: comp.text.sgml
Subject: SP 1.0.1 + Linux 1.2.13 + gcc 2.7.2 - segmentation fault
Date: Mon, 12 Feb 1996 20:56:20 +1100
Organization: The Warp Pharmacy
Message-ID: <311F0EC4.22D8EE63@zeta.org.au>
CC: jjc@jclark.com
Anyone have any experience with this combination?
nsgmls suffers a segmentation fault on all the tests that
run as a result of make check.
This is what I have installed:
linux-1.2.13 (ELF)
ld.so-1.7.14
gcc-2.7.2
libg++-2.7.1.3
libc-5.2.18
I did notice that the call stack gets corrupted by execution
of the declaration:
ParserToolkit kit;
in the first few lines of main(). Making kit a global fixed
the call stack corruption but did not stop nsgmls trapping.
Any ideas?
jon.
From: connolly@w3.org (Dan Connolly)
Newsgroups: comp.text.sgml
Subject: Re: Strongly-Typed Hyperlinks
Date: 12 Feb 1996 01:45:28 -0500
Organization: W3C
Sender: connolly@beach.w3.org
Message-ID: \
References: <4fdq2k$2vh@murphy.servtech.com>
In-reply-to: "Steven R. Newcomb"'s message of Thu, 8 Feb 1996 16:26:20 -0500
In article <4fdq2k$2vh@murphy.servtech.com> "Steven R. Newcomb" \ writes:
>
> Thanks for your reasonable, back-to-the-literature words. I think
> Randy Trigg's dissertation has not been as industrially appreciated as
> it should be. It was a prominent document in the HyTime committee's
> document register, and I still refer to it when I want to look at a
> list of garden-variety link types. The sheer open-endedness that one
> must confront when making such a list would, I think, always force an
> admission that there can be no "once-and-for-all" list of semantics to
> be expressed by linking.
It seems that there's some great experience in this area represented
by the readership of this froum.
A range of issues from standardized link types to compound document
architectures are being discussed in the HTML working group of the
IETF. There are two documents that appear to be on the standards
track, which means that the world may have to live with the decisions
that get recorded in those documents.
Please review (and send comments to html-wg@w3.org, after reviwing the
group charter etc. at: http://www.w3.org/pub/WWW/MarkUp/HTML-WG/):
( excerpted from http://www.ietf.cnri.reston.va.us/ids.by.wg/html.html
which, ironically, has a markup error that prevents three of the five
abstracts from being presented in my user agent :-)
ftp://ietf.cnri.reston.va.us/internet-drafts/draft-ietf-html-relrev-00.txt
"Hypertext links in HTML", M. Maloney, L. Quin, 01/15/1996. (64553 bytes)
The hypertext link mechanism is the connective tissue used to weave the
World Wide Web. A hypertext link is an object which specifies a
connection between any arbitrary addressable objects, locations, or
resources.
ftp://ietf.cnri.reston.va.us/internet-drafts/draft-ietf-html-cda-00.txt
"Compound Documents in HTML", P. Burchard, D. Raggett, 11/22/1995. (43493
bytes)
This specification provides an HTML implementation of a simple compound
...
Dan
--
Daniel W. Connolly "We believe in the interconnectedness of all things"
Research Scientist, MIT/W3C PGP: EDF8 A8E4 F3BB 0F3C FD1B 7BE0 716C FF21
\ http://www.w3.org/pub/WWW/People/Connolly/
From: mskuhn@unrza3.dialin.rrze.uni-erlangen.de (Markus Kuhn)
Newsgroups: comp.text.sgml
Subject: Re: ISO Public Text
Date: 12 Feb 1996 03:15:59 +0100
Organization: Markus Kuhn, 91080 Uttenreuth, Germany
Message-ID: <4fm7sv$m1@cortex.dialin.rrze.uni-erlangen.de>
References: <4flhov$ep5@reader2.ix.netcom.com>
Reply-To: mskuhn@cip.informatik.uni-erlangen.de
Carolyn Gordon \ writes:
>Where can I find ISO Public text entity declarations? Sorry
>for the dumb question, but I could not find a FAQ. Thanks in
>advance!
Erik Naggum's archive \ is a good place
to look for such files.
Markus
--
Markus Kuhn, Computer Science student -- University of Erlangen,
Internet Mail: \ - Germany
WWW Home: \
Newsgroups: comp.text.sgml,alt.hypertext
From: ht@cogsci.ed.ac.uk (Henry S. Thompson)
Subject: Re: SGML without DTDs?
In-Reply-To: Peter Murray-Rust's message of Sun, 11 Feb 96 10:18:27 GMT
Message-ID: \
Reply-To: ht@cogsci.ed.ac.uk
Organization: Centre for Cognitive Science, Edinburgh, UK
References: <9601300135.AA15133@fly.hiwaay.net> <4ek508$igd@Venus.mcs.com> <9601311617.AA22637@fly.hiwaay.net> <4eojl1$38s@Mercury.mcs.com> <9602010548.AA27472@fly.HiWAAY.net> \ <19960204T035437Z@arcana.naggum.no> <824033907snz@ursus.demon.co.uk>
Date: Mon, 12 Feb 1996 10:00:29 GMT
We have implemented and are making regular use of a system which is
very close to what you describe, which passes normalised SGML between
tools. See http://www.cogsci.ed.ac.uk/~ht/nsldoc.ps for an
introduction.
HOWEVER, we do use SP 'at the margins' of our architecture, and don't
expect people to produce normalised SGML by hand. Parsing
normalised SGML is fast and easy because we assume it's valid -- if it
isn't, we fall over hard. I'm skeptical about the possibilities of
constructing a robust parser for your normalised SGML which isn't
almost as complex as SP.
ht
--
Henry S. Thompson, Human Communication Research Centre, University of Edinburgh
2 Buccleuch Place, Edinburgh EH8 9LW, SCOTLAND -- (44) 131 650-4440
Fax: (44) 131 650-4587, e-mail: ht@cogsci.ed.ac.uk
URL: http://www.cogsci.ed.ac.uk/~ht/
From: bernd@NeRo.Uni-Bonn.DE (Bernd Kreimeier)
Newsgroups: comp.text.sgml
Subject: Re: DTD for *incorrect* HTML
Date: 12 Feb 1996 13:45:24 GMT
Organization: University of Bonn, Dept. of Comp. Sc. VI, Germany
Message-ID: \
References: \ <311E2EAB.14CC@passage.com>
In-reply-to: "W. Eliot Kimber"'s message of Sun, 11 Feb 1996 10:00:11 -0800
In article <311E2EAB.14CC@passage.com> "W. Eliot Kimber" \ writes:
> Joe Armstrong wrote:
> >
> > Is there a DTD for flawed HTML.
>
> \ \
> ]>
The w3c SGML declaration for HTML (3.0) says SUBDOC NO. Don't you
have to change this for your flawed_html.dtd as well as the w3c
HTML.DTD variations (strict, recommended)? To my understanding,
SUBDOC NO reflects the limitations of current HTML web browsers.
Perfectly complying, both Mosaic and current Netscape display minor
garbage at the document's start.
B.
From: bernd@NeRo.Uni-Bonn.DE (Bernd Kreimeier)
Newsgroups: comp.text.sgml
Subject: Re: SP 1.0.1 + Linux 1.2.13 + gcc 2.7.2 - segmentation fault
Date: 12 Feb 1996 13:49:58 GMT
Organization: University of Bonn, Dept. of Comp. Sc. VI, Germany
Message-ID: \
References: <311F0EC4.22D8EE63@zeta.org.au>
In-reply-to: jon seymour's message of Mon, 12 Feb 1996 20:56:20 +1100
This might be the same problem I ran into. Make sure to remove
any obsolete libiostream refs from the Makefile, and include
-lstdc++ instead. This fixed core's on
"nsgmls \" and all other problems
for me.
If there's a different problem, please let me know. I'd like
to put in the the "Known trouble" section of my
"SGML for Pedestrians" - if I ever find the time to write
the first release.
B.
From: milor001@maroon.tc.umn.edu (R A Milowski)
Newsgroups: comp.text.sgml
Subject: Re: nsgmls 1.0 and standard input
Date: 12 Feb 1996 07:57:21 -0600
Organization: University of Minnesota
Message-ID: <4fnh01$8sq@maroon.tc.umn.edu>
References: \
In article \,
Steve Heaney \ wrote:
>I've recently compiled and built the 1.0 version of nsgmls for Solaris 2.4
>and it works
>fine apart from one problem. When reading from standard input it
>consistently dumps
>core.
Yes, this is a bug. I believe there is a 1.1 version that fixes this.
--
==============================================================================
R. Alexander Milowski milor001@maroon.tc.umn.edu
Copernican Solutions Incorporated (612) 825 - 4132
From: mcclellantj@harrier (Tad McClellan)
Newsgroups: comp.text.sgml
Subject: Re: What are the limits of marked section declarations?
Date: 12 Feb 1996 14:19:33 GMT
Organization: Lockheed Martin Tactical Aircraft Systems
Message-ID: <4fni9l$imh@cliffy.lfwc.lockheed.com>
References: <4fg9pb$11h@news.cerf.net>
Reply-To: mcclellantj@lfwc.lockheed.com
Whatis (whatis@yyz.com) wrote:
: I'm currently converting a help system with a proprietary source
: into SGML. Most of it has gone pretty smoothly, but I'm up
: against a problem I can't solve.
: The previous help system allowed the help to be tuned to specific
: products and families of products, using boolean expressions at
: about the same level as the C pre-processor. For instance,
We, in the DoD IETM arena, refer to this as 'context filtering'. We
use many different contexts such as: aircraft effectivity, technician
skill level, whether the last test passed or failed, etc...
: @X||Y&&(!Z)
: Some text
: @ELSE
: Some other text
: @END
: The facility I see in SGML for doing stuff like this is the marked
: setion declaration, combined with parameter entities. That way, I
: can set up declarations like
: What do I do in SGML? Or am I SOL?
As another followup pointed out, you're maybe just FOL (Fart OL),
not quite as serious as SOL...
The Content Data Model (CDM) DTD (MIL-D-87269) puts the burden of
evaluating the expressions on the SGML application (Presentation
system, in our case), rather than on the SGML parser.
These are primarily implemented with the \, \,
\ and \ elements.
e.g.
\
\
\
\
\
\
\
\NUM_SEATS\
\
\1\
\
\
\
The Aircraft has one control stick.
\
\
\
\
\
\NUM_SEATS\
\
\2\
\
\
\
The Aircraft has two control sticks.
\
\
\
\
\
\
\ is a variable name that the SGML application has set
previously (as initialization, or by another IETM document).
The SGML app evaluates the \ and if TRUE presents the \
else skips \
Kinda ugly, but workable...
--
Tad McClellan, Logistics Specialist (IETMs and SGML guy)
email: mcclellantj@lfwc.lockheed.com
The "Battle of the Sexes" is perpetuated by fraternizing with the enemy.
From: stuart@telerama.lm.com (R. Edward Stuart)
Newsgroups: comp.text.sgml
Subject: Re: Musing on Networked Hypertext
Date: Mon, 12 Feb 1996 03:49:40 GMT
Organization: Telerama Public Access Internet, Pittsburgh, PA
Message-ID: <4fnk19$66@scramble.lm.com>
References: <31014D2A.7BF2@passage.com> <9601220123.AA21903@fly.HiWAAY.net>
>"The word is not the thing" -- S.I. Hayakawa.
So who the fuck is "Hayakawa?"
Thanx,
Ed Stuart
stuart@telerama.lm.com
From: stuart@telerama.lm.com (R. Edward Stuart)
Newsgroups: comp.text.sgml
Subject: Re: What's New in the SGML World?
Date: Mon, 12 Feb 1996 03:49:48 GMT
Organization: Telerama Public Access Internet, Pittsburgh, PA
Message-ID: <4fnk1h$66@scramble.lm.com>
References: <5SviKAAP8kBxEwzL@kglps.demon.co.uk> <4ejgdb$43m@franklin.its.utas.edu.au>
3. What publications would you recommend?
>"Practical SGML"
Thanx,
Ed Stuart
stuart@telerama.lm.com
From: mcclellantj@harrier (Tad McClellan)
Newsgroups: alt.hypertext,comp.text.sgml
Subject: Re: REQ: FAQ
Followup-To: alt.hypertext,comp.text.sgml
Date: 12 Feb 1996 14:44:10 GMT
Organization: Lockheed Martin Tactical Aircraft Systems
Message-ID: <4fnjnq$46l@cliffy.lfwc.lockheed.com>
References: <311D39E7.590F@hobbe.adb.gu.se>
Reply-To: mcclellantj@lfwc.lockheed.com
Jonas Liljegren (a95jonas@hobbe.adb.gu.se) wrote:
: Is there a FAQ for this newsgroup?
Not an official one. There is a wealth of info/links at:
http://www.sil.org
--
Tad McClellan, Logistics Specialist (IETMs and SGML guy)
Birthdays are good for you ...
A federally funded study has shown that people with the
most birthdays live the longest.
From: mcclellantj@harrier (Tad McClellan)
Newsgroups: comp.text.sgml,alt.hypertext
Subject: Re: SGML without DTDs?
Followup-To: comp.text.sgml,alt.hypertext
Date: 12 Feb 1996 15:34:00 GMT
Organization: Lockheed Martin Tactical Aircraft Systems
Message-ID: <4fnml8$46l@cliffy.lfwc.lockheed.com>
References: <9601300135.AA15133@fly.hiwaay.net> <4ek508$igd@Venus.mcs.com> <9601311617.AA22637@fly.hiwaay.net> <4eojl1$38s@Mercury.mcs.com> <9602010548.AA27472@fly.HiWAAY.net> \ <19960204T035437Z@arcana.naggum.no> <824033907snz@ursus.demon.co.uk>
Reply-To: mcclellantj@lfwc.lockheed.com
Peter Murray-Rust (Peter@ursus.demon.co.uk) wrote:
: I am developing SGML applications for the molecular sciences (a set of
: DTDs I shall generically call Chemical Markup Language (CML)). I have come
: to a stage where I wish to have the system implemented as a means for
: transferring molecular data between many different applications on many
: different platforms. I can assume that my community is computer-literate,
: understands the need for absolute precision in constructing documents,
: is (for the most part) well disposed to the implementation of standards,
: and is _almost totally_ ignorant of SGML. (It is however, thoroughly
: acquainted with HTML and will understand the general concept of hierarchical
: data structures with balanced delimiters).
: PROBLEM:
: I have a major problem in that to distribute this requires a great deal of
: complex and error-prone SGML technology which I do not think is necessary
: for my application. (The errors come not from SGML itself, but its use by
: a community who won't know the first thing about it :-).
: QUESTION:
: Is it possible to agree a 'subset' of SGML that will allow this community
: (and I suspect many others) to use the SGML approach with simpler technology?
: PROPOSAL:
: I would like to agree on a subset of SGML that was parsable into an ESIS
: stream without the use of DTDs. This subset would still be valid SGML and
: would parse against its DTD(s) using SGML. It relies on documents being
: valid normalised SGML without default attributes. It also assumes that there
: is a relatively limited use of entities and, I suspect, no marked sections.
: In making this proposal I am probably repeating existing suggestions and I'm
: also likely to have overlooked aspects of SGML that may cause problems, so
: please forgive these in advance.
: ELEMENTs:
: All elements should have explicit end-tags, though these could be abbreviated
: to > withoutproblems.
Or even abbreviated further if pre-processed with a normalizer such
as James Clark's 'spam'.
: However, I have a problem with EMPTY content models.
^^^^^^^^^^^^^^^^^^^^
\
Technically, EMPTY is 'declared content', not a 'content model'. See
ISO-8879 production [116] and [125]
\
: sgmls gives an error for including an end-tag for these (I assume this
: requirement is in the standard). If a parser does not have a DTD it has to
: look ahead to the end of the document before it can be sure that there is
: no closing tag for an element.
You have a similiar problem (not known if balance required/disallowed)
with #CONREF attributes.
: ATTLIST
: The primary use of ATTLIST that I make is to (a) validate that the
: attribute name is correct, and (b) to supply default values. If these
: are guaranteed by the creating software there is no need for ATTLIST.
: (SGML has weak type-checking so the loss of that is unimportant).
Another use of ATTLIST is to identify #CONREF attributes, as above.
Moot, if your DTD uses no CONREFs.
: Conclusion:
: -----------
: This proposal might be though to have removed most of the valuable bits of
: SGML, but what is left is extremely valuable for our community. Until
: I started using SGML and CoST I was unaware of how valuable it was to have
: structured hierarchical documents. With the use of CoST/DSSSL I can now
: ask much more powerful queries of primary data that I could with (say)
: the RDB model (mainly because it is such an effort to build RDBs). Moreover,
: SGML solves the problem of separating syntax from semantics which is still
: unresolved in many other systems.
--
Tad McClellan, Logistics Specialist (IETMs and SGML guy)
email: mcclellantj@lfwc.lockheed.com
The "Battle of the Sexes" is perpetuated by fraternizing with the enemy.
Newsgroups: comp.text.sgml
Date: Mon, 12 Feb 1996 11:24:36 -0600
Message-ID: <9602121724.AA22318@fly.HiWAAY.net>
From: Len Bullard \
References: <4fdq2k$2vh@murphy.servtech.com> \
Subject: Re: Strongly-Typed Hyperlinks
[Dan Conolly]
>A range of issues from standardized link types to compound document
>architectures are being discussed in the HTML working group of the
>IETF. There are two documents that appear to be on the standards track,
>which means that the world may have to live with the decisions that get
>recorded in those documents.
Thanks eversomuch for the ftp addresses to the cited documents. We can
look at these and compare them to requirements for HyTime-conforming
systems. However, just one quibble: the WWW, not the world or even the
Internet, lives with the decisions of the HTML working group. These are
separable domains.
Now ask if it is the current biggest game in town... ;-)
Len Bullard
From: jorn@MCS.COM (Jorn Barger)
Newsgroups: comp.text.sgml,alt.hypertext
Subject: Re: Strongly-Typed Hyperlinks
Date: 12 Feb 1996 10:36:21 -0600
Organization: The Responsible Party
Message-ID: <4fnqa5$p8t@Venus.mcs.com>
References: <31164A18.25C4@passage.com> <4f6onv$t18@Venus.mcs.com> \
In article \,
Dan Connolly \ wrote, quoting me:
> > To make claims about the superiority of a hypertext
> > design strategy without simulating it as far as possible
> > on the WWWeb is ***scientifically irresponsible***
>You don't even have to simulate it. Typed links have been a standard
>architecture feature of the web since its inception. Popular
>user agents just don't seem to exploit the feature.
That's reason enough to simulate, isn't it?
[...]
>I experimented with typed links in producing the HTML
rendition of the HTML 2.0 spec. >In that spec, every glossed term is
linked. It sure would be nice to have user >agents that could
deemphasize those gloss links. [...]
>http://www.w3.org/pub/WWW/MarkUp/html-spec/html-spec_5.html#SEC5.7.3
Interesting... let me think out loud about this:
So, browsers could have a 'glossary mode' that reveals which terms are
defined... or you could configure a non-obtrusive style for them, like
dark-blue text...
It sounds like a lot of markup, unless you can assign the HREF in the
header... And even then, for a word that's used 1000 times on a
page, it sounds like a pretty inefficient approach. How about an
embedded glossary-object that tells which words are glossed (and
where)?
You can learn a lot about the usefulness of this by simulating it with
regular links as you've done (except they're a lot more obtrusive) or
with an asterisk or something next to defined words, and the glossary
at the bottom of the page...
My sense, though, is that this isn't really a problem you want to try
to 'mechanize'...
I think there's a general danger in hypertext theory of trying to take
the burden off the writer, by creating 'style-templates' you can just
*fill in*.
But this *encourages* atrocious writing habits! Writers are supposed
to think carefully about who's going to be reading their work, and
fine-tune it to meet their needs.
How many people are even going to bother to create the glossary links?
Wouldn't it work better to just have a convention that all documents
should include a link labelled "If you don't understand the
vocabulary, try this first"?
What other sorts of semantic links are being proposed?
[...]
> > And you allege that this will *add value*... but let's see some
> > evidence.
>Unfortunately, all I really have right now is an intuition that typed links
>provide for richer knowledge capture and exchange, which is a Good Thing.
Well, I appreciate your honesty!
(So, as far as you know, the people who claimed that the WWWeb was
fatally flawed for lack of them, and beneath the interest of the
experts, were talking Bull? ;^/
j
-==---
. hypertext theory : artificial intelligence : finnegans wake . _+m"m+_"+_
lynx http://www.mcs.net/~jorn/ ! Jp Jp qh qh
best-of news:alt.music.category-freak ! O O O O
ftp://ftp.mcs.com/mcsnet.users/jorn/ Yb Yb dY dY
...do you ever feel your mind has started to erode? "Y_ "Y5m2Y" " no.
Newsgroups: comp.text.sgml,alt.hypertext
From: jbottoms@world.std.com (John W Bottoms)
Subject: Re: SGML without DTDs?
Message-ID: \
Organization: The World Public Access UNIX, Brookline, MA
References: <9601300135.AA15133@fly.hiwaay.net> \ <19960204T035437Z@arcana.naggum.no> <824033907snz@ursus.demon.co.uk>
Date: Mon, 12 Feb 1996 16:35:49 GMT
In article <824033907snz@ursus.demon.co.uk>,
Peter Murray-Rust \ wrote:
>I am developing SGML applications for the molecular sciences (a set of
>DTDs I shall generically call Chemical Markup Language (CML)). I have come
>PROBLEM:
>The major problem I have is that to run SGML properly every site has to
>implement:
> a parser
> DTDs for every document type that I use
> a catalog (to keep the whole thing together)
> a set of *.ent files (e.g. for ISOlat1, etc)
>QUESTION:
>Is it possible to agree a 'subset' of SGML that will allow this community
>(and I suspect many others) to use the SGML approach with simpler technology?
>PROPOSAL:
>I would like to agree on a subset of SGML that was parsable into an ESIS
>stream without the use of DTDs. This subset would still be valid SGML and
>would parse against its DTD(s) using SGML. It relies on documents being
>valid normalised SGML without default attributes. It also assumes that there
>This approach allows 'unknown' GIs, and it may be claimed that this
>completely breaks the spirit of SGML. Since the files would have to parse
>Peter Murray-Rust, GlaxoWellcome, domestic net connexion
(snipped heavily)
It is not clear whether you are talking about using a DTD or not. But
anyway...
1. You have a mixed model here. SGML is parsed. The instance can be
translated into another data stream. I'm not familiar with ESIS but
I'll assume that there is a mapping from some instance (against some
DTD) that can be mapped into an ESIS form.
2. All SGML instances require a DTD. So what you are proposing is a
modification to SGML or another metagrammar...not SGML. I think
what you are really talking about is an application that works
with a non-SGML markup that will do what you need. We could refer
to the path HTML has taken for this one. Maybe the real question
should be "Why bother with SGML at all?". It may be more baggage
than you need. And I think you have a DTD in mind but it may be
buried in your proposed application.
Finally, it sounds as though you would like to be able to use tags
which can be anything that is of a legal format and length. SGML
does not allow regular grammars for the expression of elements so
this would not be possible in SGML. Again, the driving force here is
to look at an alternate metagrammar or explain why you would like to
use SGML. Some more detail about the implemented system would be
helpful.
-jb SGML'er
From: chip@ils.nwu.edu (Chip Cleary)
Newsgroups: comp.text.sgml,alt.hypertext
Subject: Re: Strongly-Typed Hyperlinks
Date: Mon, 12 Feb 1996 11:32:57 -0600
Organization: The Institute for the Learning Sciences
Message-ID: \
References: <31164A18.25C4@passage.com> <4f6onv$t18@Venus.mcs.com> \ \ \
In article \,
Bernstein@eastgate.com (Mark Bernstein) wrote:
> SEPIA: Streitz, Thuering, Haake, Haake, et al. created an important
> collaborative hypertext system which used "activity spaces" to organize
> concurrent collaboration, and relied on link types to clarify to everyone
> (and to the system) the purpose and role of each hypertextual component.
> See especially "What's Eliza Doing in the Chinese Room" in Hypertext 89
> (or 91?)
For an example of a WWW system that uses link types in a similar way (to
clarify where the user can look for different types of information), see:
http://www.ils.nwu.edu/~e_for_e
This system uses link types to collect specific follow-up questions about
a page under general question categories. This organization is intended
to:
- allow users who want to ask specific questions to quickly locate them, and
- to provide a set of general types of questions that less directed
users can scan to see which, if any, they want to ask (and then to
show under these general questions the specific instantiations of them
that the system can actually answer).
This system is a quickie WWW implementation of a modified type of ASK
system [Bareiss & Osgood, Hypertext-93].
In article \,
Bernstein@eastgate.com (Mark Bernstein) wrote:
> Not within the constraints of http. The web is carefully engineered to
> avoid the need to keep any state information between transactions. This
> means that web servers need not maintain account information or session
> records for each user, which makes web servers *much* faster and more
> efficient than, conventional session-based designs.
Although web sites *need not* maintain state information, I think they
*can* if you want them to. For example, ACM's "living catalog" tracks
users by generating web pages that have a user id added to every lin. It
uses this capability to track one type of state -- letting users carry
around a "basket" of things they've ordered as they browse.
http://www.acm.org/catalog/
A site that wanted to customize pages for individuals based on what other
pages they've already seen could use a similar approach (as long as the
user remains within the confines of the site or logs back in). I've wanted
to add this type of capability to e_for_e for some time to better suggest
how individual users may move "forward" through the system, though I've
never found the time. Right now, each page just has a hard-coded "next"
page.
- Chip
Newsgroups: comp.text.sgml,alt.hypertext
From: mlt@netcom.com (Marcy Thompson)
Subject: Re: Strongly-Typed Hyperlinks
Message-ID: \
Organization: Netcom Online Communications Services (408-241-9760 login: guest)
References: <31164A18.25C4@passage.com> <4f6onv$t18@Venus.mcs.com> \
Date: Mon, 12 Feb 1996 18:46:38 GMT
Sender: mlt@netcom8.netcom.com
Jorn Barger writes:
> To make claims about the superiority of a hypertext
> design strategy without simulating it as far as possible
> on the WWWeb is ***scientifically irresponsible***
Is this like saying: "To make claims about the usefulness of some
new user interface feature for word processors without simulating
it as far as possible using Word macros is scientifically irresponsible"?
What if the new user interface feature were incompatible with Word as
it is currently available? What if I knew and could prove that any
test on Word would be so seriously distorted as to be meaningless or
worse?
Note that I have said nothing about whether I think that the WWW
is a useful test bed for a particular theory of hypertext design. I
think it *is* a useful test bed for some of them. But I do not believe
that it is a uniformly useful test bed for all theories of hypertext
design, and *I* think that claims that it is are irresponsible in
the extreme.
Test beds vary in their effectiveness. To know whether the WWW is
a useful test bed for a particular theory, you have to know things like
whether the theory is compatible with the design of the WWW, whether
the theory is testable on the WWW, what aspect of the theory you need
to test, and so on.
Marcy (who is *not* Eliot, okay, so would people please stop
sending me mail about it?)
--
Marcy Thompson
work: marcy@passage.com
play: marcy@squirrel.com mlt@netcom.com
From: Peter Murray-Rust \
Newsgroups: comp.text.sgml
Subject: Re: SGML without DTDs?
Date: Mon, 12 Feb 96 18:56:54 GMT
Organization: Myorganisation
Message-ID: <824151414snz@ursus.demon.co.uk>
References: <9601300135.AA15133@fly.hiwaay.net> \ <19960204T035437Z@arcana.naggum.no> <824033907snz@ursus.demon.co.uk> \
Reply-To: Peter@ursus.demon.co.uk
[First an apology; my mailer included alt.hypertext in this thread so
please remove it in any replies]
Thank you for your replies so far. I have had a chance to peruse
NSGML (Henry Thompson) which is very much along the lines of what I
wish, although I think I can make my requirements simpler. I reiterate
that all documents are valid SGML and parse using SGMLS (I am not
inventing another language). What I wish to do is:
define a valid _restricted subset_ of SGML, used to create DTDs.
create document instances which parse against these DTDs and
produce an ESIS stream (e.g. using SGMLS)
write alternative (simple) software that has no knowledge
of the DTD, but produces the same ESIS stream.
To reformulate the question with an example of a document instance:
\
\
]>
\
\xyzzy\
\plugh\
\
This parses with sgmls.1.1.91 and produces the ESIS stream:
(FOO
(BAR
-xyzzy
)BAR
(BAR
-plugh
)BAR
)FOO
which I can read into CoST and translate/render/search, etc.
Note that (so far) the \...\ cannot be unambiguously translated
into an ESIS stream without the knowledge of the DTD. For example, a
DTD like:
\
\
\
will produce a difference ESIS stream (adding additional whitespace content
and the default attribute ATT="NO" for both BAR's).
However, if we forbid mixed content (#PCDATA|BAR), and default attributes,
this DTD is not valid (under our rules). The question is whether it is
possible to create a set of rules whereby the ESIS stream can always be
generated from the document instance.
(Note that the ESIS stream does not retain all the information in the
document instance, so the reverse process (ESIS->*.sgm) is not normally
possible. With a restricted set of DTDs it might be possible to recreate
useful *.sgm files).
NSGML defines 'semi-valid' SGML. The DOCTYPE is omitted, and there can
be multiple instances of subtrees (e.g. there could be several \...\
in an instance). This is not allowed in what I am proposing - there must
always be a valid DOCTYPE and the document must validate against SGMLS.
(The non-SGMLS translation software will ignore the DOCTYPE).
Bringing together the rules in NSGML, with the additional omission of
#CONREF (Tad McClellan) and some of my own, the following set of rules
is proposed as a starting point.
- Document is coded using an ISOlatin character set, with embedded
character entities where necessary
- Reference concrete syntax is used
- No capacity or length restrictions
- All NAMES, etc must be valid SGML
- No short refs (?or tag minimisation)
- No SUBDOCs
- No marked sections
- All end-tags present (except for EMPTY elements)
- All entity references terminated with ';'
- No entity references except of type SDATA (i.e. everything is in
a single file except agreed character sets)
- All NAMEs (start and end tag GIs, attribute names) have no lower
case letters
- No CDATA or RCDATA element content
- NO #CONREF
- No mixed content (#PCDATA|FOO)
- No default attribute values
- All attribute values to have CDATA type
- All potential SGML delimiters (e.g. '<') to be escaped (e.g. \<)
Although this restricts the possible things that could be done with SGML
I believe that DTDs (and document instances) constructed in this way would
be powerful enough to bring real benefits.
This is not meant to undermine the validation procedure. A typical way that
it would be used is for an international supplier of data to provide
fully validated document instances for distribution over the WWW. These
would be flagged as readable without SGMLS and it would be possible to create
simple software which created the ESIS stream. Parsing this (e.g. with CoST)
for rendering/translation/searching, etc. would then be identical to
postprocessing SGMLS output.
Peter.
--
Peter Murray-Rust, GlaxoWellcome, domestic net connexion
From: Bernstein@eastgate.com (Mark Bernstein)
Newsgroups: comp.text.sgml,alt.hypertext
Subject: Re: Strongly-Typed Hyperlinks
Date: Mon, 12 Feb 1996 14:11:18 -0500
Organization: Eastgate Systems, Inc.
Message-ID: \
References: <31164A18.25C4@passage.com> <4f6onv$t18@Venus.mcs.com> \ <4fnqa5$p8t@Venus.mcs.com>
Ok, folks, I've just about had it.
On the subject of strongly-typed links, Dan Connolly writes:
> >Unfortunately, all I really have right now is an intuition that typed links
> >provide for richer knowledge capture and exchange
Jorn Barger "paraphrases" this response as follows:
> So, as far as you know, the people who claimed that the WWWeb was
> fatally flawed for lack of them, and beneath the interest of the
> experts, were talking Bull?
First, Barger implies that some believe that WWW is "beneath the interests
of the experts". He says this often. It is false, silly, and deceptive,
but that seems no deterrent.
Disposing of the gratuitous clauses, Barger writes that
> So, as far as you know, the people who claimed that the WWWeb was
> fatally flawed for lack of [typed links]..... were talking Bull?
This is reprehensible nettiquette, as it purports to paraphrase Connolly's
position while actually contradicting it.
This is irresponsible argumentation. How does "talking bull" contribute to
the discussion?
This is mendacious, since it sets up a preposterous straw man (the
mythical hypertext expert with whom Barger seems utterly obsessed) to
distract people from the indefensible underlying claim: that Barger has
added anything to the extensive research record on typed links, or indeed
to what he terms "hypertext theory".
--
Mark Bernstein Bernstein@eastgate.com
Eastgate Systems, Inc voice: (800) 562-1638 +1 (617) 924-9044
134 Main Street fax: (617) 924-9051
Watertown MA 02172 USA http://www.eastgate.com/
Newsgroups: comp.text.sgml
Date: Mon, 12 Feb 1996 15:17:35 -0600
Message-ID: <9602122117.AA15266@fly.HiWAAY.net>
From: Len Bullard \
Subject: Re: Facts, folks. We need facts.
Cross-posted to comp-text-sgml for those in the SGML community who still
question why DTDs are needed. They allow communities to express and make
validatible agreements. At will...
The VRML community is now witnessing what cowboy standards achieve: they
round up the cows, pen them, then ship them off to the meat-packing plant.
[Jan H.]
>I have talked to someone who claims to have seen a press release from
>Netscape confirming every rumor ever whispered (except for the Snapple
>deal).
>
>If anyone has this, they should post it, or a URL. I'll keep looking.
Well, I have read the SGI posting on the subject.
This is process? They panicked, or as the old jazz standard says, "You
made your move too soon."
The sad thing is that Moving Worlds would probably have won honestly. Now,
there can be no honest decision. It was a grand try though. Some folks
have no faith in consensus, and they were risking too much investment. If
the 4 billion dollar bubble bursts, some programmers may have to go back to
U of ChiTown and get their jobs back. Poor old WWW. It makes a nice
hostage.
Apple and Microsoft should sue, but they won't. In a situation with no
legal entity, there is no recourse to due process. Understand that this is
why real standards bodies are provenanced. What M\&A can do is walk away
from a game rigged for them to lose. They can make a deal to combine their
proposals and field a new 3D metafile format for the Internet. The
advantages of such a deal are obvious. A company in trouble that coalesces
with its stronger rival and in that combination, beats their mutual
competitors would be doing smart business even if only for a few deals.
You don't have to merge to dance.
You do have to own the framework API.
The Internet is not the WWW. Some state that VRML is for the "Net".
Smarter folks can figure out that there are many applications for 3D
metafiles that don't need, and in fact are hindered by, network engines.
There are better models for hyperlinks, there are better protocols than
HTTP, and there are better applications than HTML and VRML. The W3C and
its 120 consortium members want you to believe that success comes from
consensus.
Success comes from winning market share.
Cooperation is a way to get that. So is competition. Competition has an
advantage: competitors evolve, adapt, and take and hold resources that they
deny to other competitors. It isn't necessary to eliminate a competitor,
just starve them until they become cooperative.
Tools. The secret to success is the tools.
Novices don't know that, but they understand ease. In hypermedia, the
overwhelming number of customers are novices. By the time they figure out
they need more, they have invested too much. Remember all those "cool
people" in the seventies and eighties who swore to you that cocaine wasn't
addictive?
Smart folks know otherwise.
The content providers could choose to walk away, but we probably won't. Of
course, if someone else provides a complete tool suite at a decent price
and a complete environment guaranteed for the next release of their
operating systems, we might be persuaded otherwise. We are a fickle lot.
Making the lips of the fish move can't be that hard, and I doubt that the
current amount of extant content is that large. I'd rather develop content
for a corporation with depth and breadth in the product line than for one
banking on a browser.
As for the gods, we are men. As for heroes, beware. There was a fellow
named Alcibiades who comes to mind just now.
I vote for ActiveVRML. Call your friends and ask them to do the same. At
least all the betas will come in one box.
Len Bullard
Newsgroups: comp.text.sgml
From: donald@sq.com (Donald Teed)
Subject: Re: Tanks: TEI & SQ's Author/Editor
Message-ID: <1996Feb12.210415.23133@sq.com>
Organization: SoftQuad Inc., Toronto, Canada
References: \ \
Date: Mon, 12 Feb 1996 21:04:15 GMT
ST.KURTZ@BIONIC.zerberus.de (Stefan Kurtz) writes:
>A big "thank you" to all the people (especially to the SoftQuad people and to Arjan
>Loeffen), who sent me mails with solutions and hints.
>Well, Author/Editor _works_ with TEI.
I'm glad to hear this came to be. Just so that others in the TEI
user community can learn about the problem and the solution, we thought
it would be good to share the news on this issue with others.
The TEI DTDs are a set of DTDs that use undefined elements as well
as marked section parameter entities in the document type declaration
subset. These usages are not presently supported by SoftQuad's RulesBuilder,
although we are working on supporting these. (Marked section
parameter entities that are applied on the document instance are supported,
but those which would have effect within the DTD are not supported.)
Thus the TEI DTDs will not compile (as is) with RulesBuilder.
The work-around is:
1) do not use parameter entities in the document type declaration subset
2) do not use elements without defining a content for them.
One solution is to use a preprocessor such as Greg Murphy's preparse,
a Perl script available from ftp://ceth.princeton.edu/tei/tools/preparse,
which expands all marked sections and external parameter entities, and cleans
the content models of all elements of any undefined elements.
The reason that the mkrls program does not appear to have a problem with
the undefined elements is that some versions of this program had a bug
with a command line option to suppress warnings about defined
and unused elements. The bug caused this same switch to
suppress errors about undefined elements. Some people in
the TEI community have reinterpreted this bug as a feature.
From our point of view, RulesBuilder 3.0 fixed the bug with
the "Report elements declared but not used" feature.
I'd like anyone having problems with supported SoftQuad software
to make use of our product support. We can be contacted at
support@sq.com
--Donald Teed, Technical Support, SoftQuad