[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: HTML HOWTOs and Re: CD edition



On May 5, 12:26am, David Lawyer wrote:
> Subject: Re: HTML HOWTOs and Re: CD edition
> > On Apr 30, 11:03pm, David Lawyer wrote:
> > > Subject: HTML HOWTOs and Re: CD edition
> > ...
> > Note that a gzipped tar file is also created which is a package
> > of all the single-file HTML docs.
> >
> > The question is "should the files themselves and/or the tar
> > packages be mirrored...."?
>
> With data compression used on most modems, tar doesn't save much
> bandwidth anymore.  Since disk space is cheap, the major cost would be
> the bandwidth costs for updating the mirror sites.  But people who
> download HOWTOs from their local mirror site (instead of from Metalab)
> would conserve bandwidth.  So I think it's a good idea.  Since
> packages of all the HOWTOs come with various distributions of Linux I
> think we should supply individual HOWTOs for downloading from the
> mirror sites.  For those who don't know, we now don't supply any
> HOWTOs for downloading from the mirror sites.  If we do this we would
> be supplying single-file HTMLs from the mirrors.

Sure we do. Currently the tar package for the HTML HOWTOs,
(the multi-file HOWTOs) are available on the mirror sites and
do not require being pulled from our host machine.


> Should they be tarred?  If not, then they could be used for on-line
> browsing and anyone who wanted a copy would use the browser to save
> it.

The tar packages are nice for those that want to grab
the entire set quickly/easily (for offline browsing or
otherwise).

> But this wastes bandwidth for those that only read a small part
> of one.  One solution is to have 2 ways to read on-line: HTML-split
> (most efficient --what we do now) and HTML-single (can easily save it
> with the browser).  To encourage efficiency, I would only say "read
> HOWTOs online" for the split ones.
>
> Another issue is that of small HOWTOs.  There isn't much point in
> splitting a 10k HOWTO up into 10@ 1k chunks.  If someone reads even
> half of this small split-HOWTO they probably use more bandwidth in
> overhead than they save by transferring less text-data.  For this case
> they are better off reading the single-HOWTO.  But how to easily
> implement this?

Not easy. The split is based on document structure (when
processed). I suppose we could do something like -

   if the file size of the single-file HTML variant is less
   then X bytes, replace the multi-file HOWTO of the same name.

I'm more inclined to remain consistent with the presentation,
however - even if that means 1K chunks (just my opinion).

> One could also argue in favor of plain text which is what I use but
> with possible future interlinking between HOWTOs, I would vote for
> HTML.
>
> Since we have been giving people only the split html-howtos for
> downloading, I guess it's OK to keep doing this.  But now we give them
> a choice.  Most people who understand this option should take the
> single html-HOWTOs since they are easier to search when using a
> browser.  Also there are a lot fewer files to find with ls (or
> locate).  Thus the distribution of the split ones should be of little
> interest.  Except that the split ones are used for on-line reading to
> conserve bandwidth and are sent to all mirror sites for this purpose.

Ok, so based on those comments, I believe that the tar package(s)
for both the split AND single-file HOWTOs should be made available
from the mirrors.

What about the single-file files themselves (not just the tar file)?
I think they should probably remain accessible only off the host
machine (and not mirrored).

> From the way it looks now in docs.html#howto most people might just
> choose "HTML" and not understand what is meant by "single-file".  Thus
> I think that the split ones need to be called "HTML - split" with the
> "single-file" ones coming first.  There are probably a few sites that
> have added useful material to the indexes that need the split ones
> (although they could generate them from the .sgml files).  Eventually,
> "HTML - single file" could be changed to just HTML since they would be
> used much more than the split ones.

Yes, but the split ones are still the primary browseable
docs, right? That's what you said earlier in your msg (if
I read it correctly).

So, in summary (correct this if I misunderstood) :

  * mirror the split HOWTOs for (primary) browsing         -- DONE
  * mirror the tar files containing the split HOWTOs       -- DONE
  * keep the single-file HOWTOs on the host only           -- DONE
  * mirror the tar files containing the single-file HOWTOs -- OPEN

regards,



-- 
Greg Ferguson     - s/w engr / mtlhd         | gferg@sgi.com
SGI Tech Pubs     - http://techpubs.sgi.com  | 
Linux Doc Project - http://www.linuxdoc.org  |


--  
To UNSUBSCRIBE, email to ldp-discuss-request@lists.debian.org
with a subject of "unsubscribe". Trouble? Contact listmaster@lists.debian.org