Home | Contact Us | FAQ | Search & Site Map | Link to Us
Sign In | Join | Other 45 Sites in Network
Home
Discussion Groups
General
GeneralPortable MacsHardwareNetworking
Applications
Mac ApplicationsEudoraFirefox / MozillaInternet ExplorerOutlook ExpressMS OfficeEntourageExcelPowerPointWordVirtual PCMedia PlayerOther MS Products
Programming
Mac ProgrammingCodeWarriorPerl
Country Specific
Australian Mac GroupUK Mac Group

Mac Forum / Applications / Eudora / April 2008



Tip: Looking for answers? Try searching our database.

Odysseus update, change in file storage

Thread view: 
Enable EMail Alerts  Start New Thread
Thread rating: 
R. Millstein - 31 Mar 2008 00:45 GMT
Hi all,

For those of you not following the Odysseus forums (Odysseus being a
project from Infinity Data Systems, attempting to recreate the features
of Eudora), there are two pieces of news:

1. There have been a couple of betas, with basic functionality.  Version
1.0 is promised for May 12.  (Many of us are skeptical that this date
will be kept, but what software is ever delivered on time?)

See <http://www.infinitydatasystems.com/products/odysseus/index.html>

2. They are changing from a SQLite Hybrid Method (their initial plan,
and what they are currently using in the betas) to a maildir method (for
beta 0.5 and beyond), rather than the mbox that Eudora currently uses.

See <http://www.infinitydatasystems.com/forums/viewtopic.php?f=3&t=223>

Roberta
Signature

Roberta Millstein
usenet@spamaway.rlm.net
Remove "spamaway" to reply

--
Posted via a free Usenet account from http://www.teranews.com

Martin S. - 31 Mar 2008 13:30 GMT
Thanks for the info.

I admit I'm not very technically minded, so I'm wondering what all of
that means for practical use.

Are you on the beta programme? If so how do you like it?

Signature

Cheers  Martin

R. Millstein - 02 Apr 2008 02:09 GMT
> I admit I'm not very technically minded, so I'm wondering what all of
> that means for practical use.

Well, if you go to the thread (again, that's
<http://www.infinitydatasystems.com/forums/viewtopic.php?f=2&t=223> ),
you will see there is some disagreement on that point.  But reading
through it might give you a sense of whether the concerns raised would
apply to you.

> Are you on the beta programme? If so how do you like it?

I am, but the last beta came out when I was away at a conference, and by
the time I was able to turn my attention to it, there were already a
gazillion messages to wade through.  So, I'm going to wait for the next
round.  My general impression is that people are pleased with what they
saw.

Roberta
Signature

Roberta Millstein
usenet@spamaway.rlm.net
Remove "spamaway" to reply

--
Posted via a free Usenet account from http://www.teranews.com

John H Meyers - 31 Mar 2008 23:06 GMT
On Sun, 30 Mar 2008 18:45:45 -0500, Roberta Millstein wrote:

> Odysseus is changing from a SQLite Hybrid Method (their initial plan,
> and what they are currently using in the betas) to a maildir method
> (for beta 0.5 and beyond), rather than the mbox that Eudora currently uses.

"Maildir" format stores one message per file;
I have heard that Odysseus will use one folder per mailbox.

IMO, Maildir organization offers some value for mail _servers_,
but is a very poor, inefficient (re most other functions)
and wasteful choice for personal computers.

When it comes time to list, view, archive, backup, copy, Spotlight
or otherwise process a large accumulation of mail over time,
in which you may have tens or hundreds of thousands
of individual files, rather than collections of messages per file
(like Mbox format), I think this will become apparent.

The article below covers some aspects used only on servers,
but this may help indicate _why_ it was developed
primarily for servers, rather than for personal computers:

http://en.wikipedia.org/wiki/Maildir

Traditional Mbox format:

http://en.wikipedia.org/wiki/Mbox

--
Elmo P. Shagnasty - 01 Apr 2008 01:33 GMT
> IMO, Maildir organization offers some value for mail _servers_,
> but is a very poor, inefficient (re most other functions)
[quoted text clipped - 5 lines]
> of individual files, rather than collections of messages per file
> (like Mbox format), I think this will become apparent.

Not with modern operating systems.

Certainly not with OS X.
John H Meyers - 01 Apr 2008 01:55 GMT
I wrote:

> IMO, Maildir organization offers some value for mail _servers_,
> but is a very poor, inefficient (re most other functions)
[quoted text clipped - 5 lines]
> of individual files, rather than collections of messages per file
> (like Mbox format), I think this will become apparent.

On Mon, 31 Mar 2008 19:33:14 -0500, Elmo wrote:

> Not with modern operating systems.
>
> Certainly not with OS X.

I'm glad to hear that processing hundreds of thousands of files
during every search, backup, archiving, copying, etc.,
does not slow down OS X, nor even require any more storage space,
but Odysseus is a cross-platform product,
which has to run even on Windows, where it very considerably does :)

The standard "zip" archive format ("Compressed folders" in Windows)
is limited to 65535 total objects, for example.

If Leopard is going to kill "Classic Eudora,"
then that's reason enough for me not to switch to a Mac at home :)

--
Elmo P. Shagnasty - 01 Apr 2008 02:07 GMT
> > Not with modern operating systems.
> >
[quoted text clipped - 3 lines]
> during every search, backup, archiving, copying, etc.,
> does not slow down OS X, nor even require any more storage space,

That's right.  It doesn't.  OS X has great file handling, and backup
programs don't scan every file to see if they've changed--backup
programs count on the OS to tell it that.

> but Odysseus is a cross-platform product,
> which has to run even on Windows, where it very considerably does :)

Doesn't Windows have a good file system?

> If Leopard is going to kill "Classic Eudora,"
> then that's reason enough for me not to switch to a Mac at home :)

I haven't switched to Mac OS 10.5, because if it ain't broke I don't fix
it.
John H Meyers - 01 Apr 2008 22:05 GMT
I wrote:

> I'm glad to hear that processing hundreds of thousands of files
> during every search, backup, archiving, copying, etc.,
> does not slow down OS X, nor even require any more storage space.

On Mon, 31 Mar 2008 20:07:40 -0500, Elmo wrote:

> That's right.  It doesn't.  OS X has great file handling, and backup
> programs don't scan every file to see if they've changed -- backup
> programs count on the OS to tell it that.

Get a stopwatch and check that out -- even checking the "last mod date/time"
(actually it should be "last inode change time") for hundreds of thousands
of files can not be instantaneous, nor is checking for the inferior
"archive flag" on hundreds of thousands of Windows files,
nor is Winzip's matching all this against the existing catalog
of a previous backup, when performing an "update," etc.

>> but Odysseus is a cross-platform product,
>> which has to run even on Windows, where it very considerably does :)

> Doesn't Windows have a good file system?

NTFS can't perform miracles, either :)

There is an overhead per file in any processing,
as well as an overhead per KB of processed content;
file systems usually also allocate in units which leave
some "slack" (unused space) in every file, plus space
which contains the characteristics of every individual file,
which makes a large collection of small files take more space
(and more time to process in any way) than a collection
of fewer combined files; for very small files,
the overhead per file may be much more than
either the space or the processing time for its content alone.

> I haven't switched to Mac OS 10.5,
> because if it ain't broke I don't fix it.

And I'm not switching from XP to Vista, not only because XP ain't broke,
but because Vista is :)

Sooner or later, however, the vendor not only stops "supporting" the older OS,
but somehow forces new software to start requiring it,
and one day the pressure is too much to continue living with,
although at different points for different folk.

--
Jim Gibson - 02 Apr 2008 00:44 GMT
> > > Not with modern operating systems.
> > >
[quoted text clipped - 7 lines]
> programs don't scan every file to see if they've changed--backup
> programs count on the OS to tell it that.

Which backup programs do that? Of the three with which I am familiar
(Retrospect, Super Duper!, and Time Machine), only Time Machine uses
functionality built in to the OS to decide which files to copy. That is
why I can run Time Machine every hour. The others will take 10-15
minutes to scan my entire disk and its ~100,000 files, even if there
are only 100 MB or so to backup. That is why I run these other programs
at night. I don't it would be possible for more than one backup program
to use OS features to determine which files have been modified. And
even one program doing that will have problems maintaining more than
one backup copy.

I try to have at least 3 different backups of my computer, because in
my experience backups are very fragile and easy to get wrong.

> I haven't switched to Mac OS 10.5, because if it ain't broke I don't fix
> it.

So do you have any experience with Time Machine?

Signature

Jim Gibson

Irritated User - 09 Apr 2008 23:34 GMT
> Doesn't Windows have a good file system?

No it certainly does not!

The Windows filing infrastructure is a pathetically limited system that
we long suffering users have had inflicted upon us and which we've
endured for years.

Where would you have me to begin?  Perhaps here:

It has:

-  A pitifully limited number of file attributes - want an off-the-cuff
list of what's missing?

-  No extensible or on-the-fly user/program/OS-defined attributing and
other such metadata.

-  No data/attribute encapsulation (attributes exist as relationships
within the O/S but many disappear the moment the file leaves the
operating system).  (Next time you see an email attachment, ask yourself
how many attributes were lost when it left its previous filing
system--or even ones before that.)

-  No facility for history /[or other]/ metadata--previous computers,
locations, user info, apps etc.--is built into the filing structure.

-  No intrinsic/built-in encryption/authentication infrastructure is
available.

-  Cannot properly handle files with filenames with greater than 254
CHR$ yet Internet Explorer can save them longer than that.  A Mexican
stand-off then exists and the file can't be moved or deleted.  /[A truly
great design faux pas.] /

-  Plain language can't always be used in filenames.  For instance,
reserved characters such as '?' or ':' cannot be used.  This stupid and
ancient anomaly is a throwback to the days of CP/M.  /[Don't say there
is no way around this as there is, moreover an example exists within XP
to show how it can be done.]   /

-  No intrinsic SQL-like database infrastructure.  /(Whatever happened
to WinFS?  With at least a wishful thought of WinFS, even Microsoft has
accepted that existing brain-dead filing system must go)./

-  Bogs down to snail's pace or slower with a 1000+ files in directory,
absolutely glacial when it has ten times this number or more.  Such
numbers are impracticably small for many applications.

-  No intrinsic CRC-like checking facility for data corruption is built
into the file structure (a file can be damaged internally yet still
appear OK to the Windows filing system).

(Here's a typical (and good) example of the problem:  large Eudora .MBX
mail files are notoriously susceptible to file corruption.  When say
your computer crashes with an open 400MB, 1GB or bigger Eudora .MBX file
then you've a pretty good chance that sections of the file will be
corrupted and you'll never find out until it's too late.  Moreover,
maintenance utilities, or say programs based on the Eudora Automation
Server, will effectively see this corruption as an EOF (end of file
marker) and truncate the file at that point--voilà, your 1GB of messages
is now, say, down to only 400MB and Eudora is none the wiser--nor are
you!  Yep, it's the Windoze apology for a filing system that's directly
responsible for that corruption.

...And I've only just begun.

Even the most lackadaisical types should have a modicum of concern about
this awful neolithic throwback.

/BTW, UNIX filing systems et al are hardly much better.
/
>:-o
R. Millstein - 10 Apr 2008 01:41 GMT
> > Doesn't Windows have a good file system?
>
> No it certainly does not!

[rant deleted]

Definitely off-topic for the Mac Eudora newsgroup.

Roberta
Signature

Roberta Millstein
usenet@spamaway.rlm.net
Remove "spamaway" to reply

--
Posted via a free Usenet account from http://www.teranews.com

Irritated User - 10 Apr 2008 04:23 GMT
>  
>>    
[quoted text clipped - 9 lines]
> Definitely off-topic for the Mac Eudora newsgroup.
>  
Not really.

Clearly, you don't have--and I'd say never seen--gigabytes of damaged
Eudora files.  I've about 35GB of .MBX files awaiting rescue all damaged
by Windows crashes and the failure of its antique filing system to cope
with such situations.

Any replacement for Eudora should:

-  be hardened against such failures (even Microsoft .PST files don't
suffer such an ill fate).

-  have vastly better mail management utilities (presently, compacting a
damaged Eudora mailbox will truncate it).  (Do you reckon this to be a
good idea?)

-  that the new mail filing system for any Eudora replacement should be
open so that 3rd party maintenance/backup/rescue are easily made
(Qualcomm never published the specifications of the .TOC index files,
this only exacerbated the problem).

Developing a mail package shouldn't just consist of counting its pretty
features, therefore discussing Eudora data security and management is
clearly a prime topic for this newsgroup.

As is usual in the software industry, those that develop software are
never the ones found mopping up the shrapnel on the bleeding edge of
user-land.  Failing to adequately consider BORING data management issues
as a fundamental part of the development cycle is no longer a
satisfactory response.  Users are demanding more, and if developers
don't adequately respond then you'll eventually find the legislators
forcing the matter with warranty legislation and software lemon laws.

Sorry if the boring bits hurt.



> Roberta
>  
Elmo P. Shagnasty - 10 Apr 2008 11:38 GMT
> > Definitely off-topic for the Mac Eudora newsgroup.
> >  
[quoted text clipped - 4 lines]
> by Windows crashes and the failure of its antique filing system to cope
> with such situations.

So how does your response address, in any way, Roberta's comment that
such discussion is definitely off-topic for the Mac Eudora newsgroup?

> Developing a mail package shouldn't just consist of counting its pretty
> features, therefore discussing Eudora data security and management is
> clearly a prime topic for this newsgroup.

The discussion at hand wasn't about Eudora data security and management;
it was about the shortcomings of the Windows filing system.

Given that, how does your response address, in any way, Roberta's
comment that such discussion is definitely off-topic for the Mac Eudora
newsgroup?

> Sorry if the boring bits hurt.

Sorry if actual relevance to the discussion at hand hurts.

I bet you have all sorts of trouble communicating with others at all
levels in your life.
Palle Jensen - 11 Apr 2008 21:58 GMT
> So how does your response address, in any way, Roberta's comment that
> such discussion is definitely off-topic for the Mac Eudora newsgroup?

In this case I suggest you should all go to the comp.mail.odysseus.mac group.

Signature

Venlig hilsen / Best regards
Palle Jensen
If you want to E-mail me then please remove the US President.

Elmo P. Shagnasty - 11 Apr 2008 23:17 GMT
> > So how does your response address, in any way, Roberta's comment that
> > such discussion is definitely off-topic for the Mac Eudora newsgroup?
>
> In this case I suggest you should all go to the comp.mail.odysseus.mac group.

As opposed to the comp.mail.eudora.mac group?

When the a.shole insists that his rant on Windows and its filing system
is on-topic for this group, how is running to another Macintosh group
going to change anything?

Answer:  it won't.
Palle Jensen - 12 Apr 2008 09:18 GMT
> As opposed to the comp.mail.eudora.mac group?

Yessirs!

> When the a.shole insists that his rant on Windows and its filing system
> is on-topic for this group, how is running to another Macintosh group
> going to change anything?

Are you tired of listening to off-topic conversations then? Guess what
I am, listening to talk about Odysseus in a mac/Eudora group.

Signature

Venlig hilsen / Best regards
Palle Jensen
If you want to E-mail me then please remove the US President.

Elmo P. Shagnasty - 12 Apr 2008 12:53 GMT
> Are you tired of listening to off-topic conversations then? Guess what
> I am, listening to talk about Odysseus in a mac/Eudora group.

You're tired of talking about the Eudora replacement in a Eudora group?
Bernd Fröhlich - 12 Apr 2008 14:38 GMT
> > Are you tired of listening to off-topic conversations then? Guess what
> > I am, listening to talk about Odysseus in a mac/Eudora group.
>
> You're tired of talking about the Eudora replacement in a Eudora group?

Umm, my Eudora replacement is Mailsmith, but I guess that doesn´t make
it on topic here ;-)
Tim Streater - 12 Apr 2008 23:27 GMT
> > > Are you tired of listening to off-topic conversations then? Guess what
> > > I am, listening to talk about Odysseus in a mac/Eudora group.
[quoted text clipped - 3 lines]
> Umm, my Eudora replacement is Mailsmith, but I guess that doesn´t make
> it on topic here ;-)

Remind me again how MailSmith is a replacement for Eudora? Does it have:

1) the ability to edit received mails/subject lines
2) a unified "Who" column
3) A "Send again" command
4) Attachments extracted
5) Automatic distinguishing of Sent mails (by making them italic) when
they are moved to another mailbox from the Out mailbox
6) A tabbed interface on the Win version (too bad the Mac one doesn't
have this)

and a trillion other features I can't remember right now?

Prolly not, which is why I stick with Eudora and am hoping that Odysseus
will have these and other Eudora features PDQ. I would say that makes
Odysseus completely on-topic for this NG, and MailSmith off, generally
speaking.
Bernd Fröhlich - 13 Apr 2008 10:07 GMT
> Remind me again how MailSmith is a replacement for Eudora? Does it have:
>
> 1) the ability to edit received mails/subject lines

I don´t need that.

> 2) a unified "Who" column

Yes

> 3) A "Send again" command

Yes

> 4) Attachments extracted

Yes

> 5) Automatic distinguishing of Sent mails (by making them italic) when
> they are moved to another mailbox from the Out mailbox

Not that I know of but since all my sent mail is in a separate folder I
don´t care.

> 6) A tabbed interface on the Win version (too bad the Mac one doesn't
> have this)

Again I don´t need that.

> and a trillion other features I can't remember right now?

Yes :-)
Tim Streater - 13 Apr 2008 11:57 GMT
> > Remind me again how MailSmith is a replacement for Eudora? Does it have:
> >
> > 1) the ability to edit received mails/subject lines
>
> I don´t need that.

I did check out MailSmith as a possible alternative to Eudora. This is a
missing killer feature.

> > 2) a unified "Who" column
>
> Yes

Except is doesn't do the right thing for sent mails moved to another
mailbox.

> > 3) A "Send again" command
>
[quoted text clipped - 9 lines]
> Not that I know of but since all my sent mail is in a separate folder I
> don´t care.

Another missing killer feature. I want to keep all mails that relate to
a topic, together.

> > 6) A tabbed interface on the Win version (too bad the Mac one doesn't
> > have this)
>
> Again I don´t need that.

It gives me quick access to the things I am working on at this moment.

> > and a trillion other features I can't remember right now?
>
> Yes :-)

Three missing killer features. Obviously not a Eudora replacement and
therefore OT. End of story.

This correspondence is now closed - Editor.
R. Millstein - 13 Apr 2008 19:13 GMT
> > > Are you tired of listening to off-topic conversations then? Guess what
> > > I am, listening to talk about Odysseus in a mac/Eudora group.
[quoted text clipped - 3 lines]
> Umm, my Eudora replacement is Mailsmith, but I guess that doesn´t make
> it on topic here ;-)

As long as I have been reading this group (a long time), there has been
discussion of other Mac email software in comparison to Eudora.  To me
that is a far cry from a discussion of the Windows file system.  So, as
far as I am concerned, discussion of Odysseus and Mailsmith in
comparison to Eudora are both legitimate.  At some point if Odysseus
really takes off, it might make sense to have a separate group for it,
but we're nowhere near that point yet.  (I don't know where the
Mailsmith people hang out).

Roberta
Signature

Roberta Millstein
usenet@spamaway.rlm.net
Remove "spamaway" to reply
** Posted from http://www.teranews.com **

FeinX_Awl-Mighty - 13 Apr 2008 19:39 GMT
> > > > Are you tired of listening to off-topic conversations then? Guess what
> > > > I am, listening to talk about Odysseus in a mac/Eudora group.
[quoted text clipped - 14 lines]
>
> Roberta

Mailsmith has not been updated for over 3 years.

To me that seems as if it is DEAD.
Signature

Keep you in the dark
You know they all pretend
Keep you in the dark
And so it all began

R. Millstein - 13 Apr 2008 21:26 GMT
In article
<kroner-30185-1EC88D.14390613042008@70-3-168-216.area5.spcsdns.net>,

> Mailsmith has not been updated for over 3 years.
>
> To me that seems as if it is DEAD.

Even with a beta as recent as April 12, 2008?

Roberta
Signature

Roberta Millstein
usenet@spamaway.rlm.net
Remove "spamaway" to reply
** Posted from http://www.teranews.com **

FeinX_Awl-Mighty - 13 Apr 2008 23:33 GMT
> In article
> <kroner-30185-1EC88D.14390613042008@70-3-168-216.area5.spcsdns.net>,
[quoted text clipped - 6 lines]
>
> Roberta

They just woke up but to me that shows lack of interest and support.
Therefore any new software can not have many expectations.

3 years to a beta...yea right
Signature

Keep you in the dark
You know they all pretend
Keep you in the dark
And so it all began

Tim Streater - 13 Apr 2008 19:51 GMT
> > > > Are you tired of listening to off-topic conversations then? Guess what
> > > > I am, listening to talk about Odysseus in a mac/Eudora group.
[quoted text clipped - 12 lines]
> but we're nowhere near that point yet.  (I don't know where the
> Mailsmith people hang out).

Well, BB runs a list but its fairly quiet. There's a beta which
addresses most of my wants except editable mails.
Palle Jensen - 12 Apr 2008 15:25 GMT
> You're tired of talking about the Eudora replacement in a Eudora group?

Actually, yes.

My Mac replaced a PC. That doesn't allow me to spam PC groups with Mac
postings.

Signature

Venlig hilsen / Best regards
Palle Jensen
If you want to E-mail me then please remove the US President.

Sander Tekelenburg - 10 Apr 2008 13:00 GMT
[...]

> > [rant deleted]
> >
> > Definitely off-topic for the Mac Eudora newsgroup.
> >  
> Not really.

*plonk*

Signature

Sander Tekelenburg, <http://www.euronet.nl/~tekelenb/>

Mac user: "Macs only have 40 viruses, tops!"
PC user: "SEE! Not even the virus writers support Macs!"

John H Meyers - 10 Apr 2008 21:06 GMT
"Irritated User" has made many mis-statements and mis-directed complaints,
but I'm sure is too busy foaming at the mouth to listen,
or to be benefited by any time invested by others in supplying useful information.

Bellyaching is commonly an inadequate response to self-caused ills,
in attempt to assign responsibility elsewhere.

--
R. Millstein - 11 Apr 2008 05:03 GMT
> >  
> >>    
> >>>      
> >>> Doesn't Windows have a good file system?
             ~~~~~~~

> >>>      
> >> No it certainly does not!
[quoted text clipped - 8 lines]
>  I've about 35GB of .MBX files awaiting rescue all damaged
> by Windows crashes and the failure of its antique filing system
    ~~~~~~~
Comp.mail.eudora.mac.  Mac-in-tosh.

Macintosh, macintosh, macintosh.  Made by that fruit company in
Cupertino.

Roberta
Signature

Roberta Millstein
usenet@spamaway.rlm.net
Remove "spamaway" to reply

--
Posted via a free Usenet account from http://www.teranews.com

Hauke Fath - 11 Apr 2008 14:04 GMT
> --------------010608090609030207040302
> Content-Type: text/html; charset=ISO-8859-1
[quoted text clipped - 6 lines]
>   <title></title>
> </head>

Please do not, I repeat, NOT post HTML to Usenet.

Thank you.

       hauke

Btw: Are you sure you own the email address
<irritated.user@whatever.com>?

Signature

Now without signature.

Bill Cole - 02 Apr 2008 18:57 GMT
> I wrote:
>
[quoted text clipped - 22 lines]
> The standard "zip" archive format ("Compressed folders" in Windows)
> is limited to 65535 total objects, for example.

And Eudora mailboxes are limited to 32767 messages.

Maildir++ (which presumably is the variant any modern mail software
would be using) does not put all messages in one directory, it splits
them by mailbox. I think Elmo is overstating the ease with which MacOS X
handles directories with very large numbers of files, but it does manage
rather well in comparison to older versions using older versions of HFS
or in comparison to FAT32 on Win2k (dunno about NTFS...) To put a random
data point with real numbers on it, this is what my nightly disk clone
said last night:

| 11:22:12 PM | Info |       Evaluated 1555307 items occupying 228.58 GB
(174396 directories, 1340085 files, 40826 symlinks)
| 11:22:12 PM | Info |       Copied  43638 items totaling  0.95 GB (418
directories, 2396 files, 40824 symlinks)
| 11:22:12 PM | Info |       Cloned  224.93 GB of data in 1290 seconds
at an effective transfer rate of 178.55 MB/s

That's on a 2GHz Core Duo (iMac) with 2GB RAM running 10.4.11 (latest
Tiger.) So, it is managing to stat well over 1000 filesystem nodes per
second.

It is easy to pick a side in the MUA storage format religious war and
ignore the serious hardness of the problem in the modern world.
Different people's priorities make different storage formats better for
them, and the scaling demands of modern users make the shortcomings of
each of the main choices intolerable for some slice of users.

Personally, I like the idea of mixed formats: mbox for archives,
something like Maildir++ for active mailboxes, and rich metadata indexes
for all mailboxes so that you don't have to go mucking around inside the
actual messages to keep status flags or search on key fields. I am
influenced by the fact that I use IMAP and run Dovecot as my server, and
it can support this sort of model.

> If Leopard is going to kill "Classic Eudora,"
> then that's reason enough for me not to switch to a Mac at home :)

Classic Eudora has a long list of issues on Tiger, they just are not
fatal. It seems to me that the Leopard issues are also non-fatal, if a
bit more annoying.

Signature

Now where did I hide that website...

R. Millstein - 03 Apr 2008 05:10 GMT
> Maildir++ (which presumably is the variant any modern mail software
> would be using) does not put all messages in one directory, it splits
[quoted text clipped - 4 lines]
> data point with real numbers on it, this is what my nightly disk clone
> said last night:

Thanks, Bill.  I was hoping you'd weigh in on this, since you've
discussed mail formats many times in the past and seem to know more
about it than most of us.  You confirmed my general impression; current
OSes and computers make things better, but there are still performance
issues to be reckoned with.

> It is easy to pick a side in the MUA storage format religious war and
> ignore the serious hardness of the problem in the modern world.
> Different people's priorities make different storage formats better for
> them, and the scaling demands of modern users make the shortcomings of
> each of the main choices intolerable for some slice of users.

And that's how the debate seems to be playing out on the IDS forums; we
have different priorities, and so argue for different things.  And some
just don't really care much one way or the other.

> Personally, I like the idea of mixed formats: mbox for archives,
> something like Maildir++ for active mailboxes, and rich metadata indexes
> for all mailboxes so that you don't have to go mucking around inside the
> actual messages to keep status flags or search on key fields. I am
> influenced by the fact that I use IMAP and run Dovecot as my server, and
> it can support this sort of model.

I wonder if IDS would consider something like that?  Seems like a
reasonable compromise.

> Classic Eudora has a long list of issues on Tiger, they just are not
> fatal. It seems to me that the Leopard issues are also non-fatal, if a
> bit more annoying.

They seem quite serious for some, and not at all for others.  So far,
I'm in the latter camp.  *knocks head*

Roberta
Signature

Roberta Millstein
usenet@spamaway.rlm.net
Remove "spamaway" to reply

--
Posted via a free Usenet account from http://www.teranews.com

John H Meyers - 03 Apr 2008 06:57 GMT
On Wed, 02 Apr 2008 23:10:29 -0500, Roberta Millstein wrote:

> [Bill Cole] confirmed my general impression;
> current OSes and computers make things better,
> but there are still performance issues to be reckoned with.

One point of misunderstanding may be in comparing a "local machine"
based protective mechanism (e.g. "Time Machine") with the general
issue of making a "portable" or "removable" archive
(I was focusing on the issue of backup and archiving).

I've probably mentioned that the "POP account" which I use with Eudora
is at Gmail.  Since all my incoming messages (and even outgoing messages,
via "Bcc:") are stored there, it can be said that "zero time" is required
by my "backup" operations, because each new message is simply added
to the already-accumulated message store, which acts as my "backup."

"Time machine" could be (not that I actually know its internals)
simply noting which files have changed, now and then "checkpointing"
the accumulated list, and paying no attention to those not receiving
any new activity.

What I'm referring to, however, is the operation of making
a complete, _removable_ archive (or backup) of all my Eudora data;
this always requires [re-]inspection of every single file,
plus an entry in a "catalog" for every single file, etc.,
which creates an unavoidable "per file" overhead in various ways
(including file "slack" space, compression inefficiency of so many
small files vs. fewer but larger files, file descriptors for every
individual file, directory entries and space for every individual file,
opening and closing every individual input and output file during copying
of an entire store, and so forth).

Searching is another activity which is strained by per-file processing;
the only way to _seem_ to compensate for that is by an extraordinary
amount of up-front indexing, which only shifts the burden around,
also requiring far more storage, plus a complex and more fragile system,
where certain activities (e.g. editing messages) tax its coherence,
and which also commonly have more limitations in searching
(e.g. only "whole words" [like Google], "words beginning with" etc.)

Even returning to the issue of local file storage, the NTFS (Windows)
file system makes an estimate, based on the size of an entire disk,
how large a "Master File Table" to originally allocate;
if the system actually decides to create vastly many more (smaller)
files than the anticipated distribution, that "MFT" may become fragmented
when its initial allocation runs out, reducing overall efficiency
(the MFT itself can not be "defragmented" during normal defragmentation);
practically anything which goes to extremes will end up falling away
from an optimum, and the "one message per file" strategy,
apparently conceived for use on servers handling dynamic,
temporary storage of transient message traffic, is pretty extreme
when it comes to being used for a long-term, permanent repository,
accumulating an enormous number of permanent messages.

Is there any successful database system, by the way,
which stores "one record per file"?

--
R. Millstein - 03 Apr 2008 19:09 GMT
> On Wed, 02 Apr 2008 23:10:29 -0500, Roberta Millstein wrote:
>
[quoted text clipped - 6 lines]
> issue of making a "portable" or "removable" archive
> (I was focusing on the issue of backup and archiving).

John, there's no misunderstanding here.  I'm referring to the discussion
over at IDS, where the claim is that there are no performance issues
with maildir that can be addressed within the email software and by
improved computing power conjoined with modern OSes.  I've been arguing
just as you are, that it depends on how one is using one's computer.  If
all you're using is Time Machine, then things are golden and maildir is
not an issue.  But if you want to create a portable archive, or do a
full backup, then maildir is going to suffer from performance problems.  

> "Time machine" could be (not that I actually know its internals)
> simply noting which files have changed, now and then "checkpointing"
> the accumulated list, and paying no attention to those not receiving
> any new activity.

Yes, that is more or less how it works.

> What I'm referring to, however, is the operation of making
> a complete, _removable_ archive (or backup) of all my Eudora data;
[quoted text clipped - 6 lines]
> opening and closing every individual input and output file during copying
> of an entire store, and so forth).

Yep, I'm with you.  But it's not me that you have to convince.

> Is there any successful database system, by the way,
> which stores "one record per file"?

An interesting question.

Roberta
Signature

Roberta Millstein
usenet@spamaway.rlm.net
Remove "spamaway" to reply

--
Posted via a free Usenet account from http://www.teranews.com

John H Meyers - 03 Apr 2008 23:22 GMT
On Thu, 03 Apr 2008 13:09:46 -0500, Roberta Millstein wrote:

JHM>> One point of misunderstanding may be in comparing a "local machine" ...

> John, there's no misunderstanding here...

> I'm with you.  But it's not me that you have to convince.

The misundertanding I meant was that Elmo's original assertions
seemed based on the "local machine" direction of thought,
and that the conflict between those assertions and my views
could be resolved by illustrating the different context.

If I happened to reply to the last post I saw and it was yours,
that was just an artifact of my "FIFO newsgroup reading" habits
(I can't even thread a needle :)

>> Is there any successful database system, by the way,
>> which stores "one record per file"?

> An interesting question.

I hoped so :)

Best.
Bill Cole - 03 Apr 2008 19:27 GMT
> On Wed, 02 Apr 2008 23:10:29 -0500, Roberta Millstein wrote:
>
[quoted text clipped - 17 lines]
> the accumulated list, and paying no attention to those not receiving
> any new activity.

FWIW, here is how it works: it hooks file system events to keep track of
what data and metadata needs to be backed up, and does the backup in
hourly batches that take no prep time because every event that results
in a need to do actual backup work has already been recorded. Similar
ideas have been around for quite a while, with things like the Veritas
storage suite's 'snapshot' features. It is my understanding that a
similar functionality is available on Windows when using NTFS, but that
may just be on the server version.

That sort of backup mustering that is done all of the time by software
that is hooked into the filesystem layer is where the future is for all
routine backup. Storage systems have become too large in general to let
the traditional model of a backup client being scanned retrospectively
for changes at backup time to survive. A growing number of OS's are
implementing ways for userspace software to receive notification of
filesystem changes, and that sort of capability can do away with the
need for a filesystem scan ahead of an incremental backup; all you need
is a resident backup client agent that gets notified on every filesystem
change.

It is also worth noting that such a model is not logically limited to
live local backup media. Supposedly Time Machine works just fine on
laptops that leave their TM disks behind for long periods and reconnect
later, although of course the granularity of the backups suffers.

> What I'm referring to, however, is the operation of making
> a complete, _removable_ archive (or backup) of all my Eudora data;
[quoted text clipped - 6 lines]
> opening and closing every individual input and output file during copying
> of an entire store, and so forth).

To some degree that's a backup implementation effect. A full backup of a
discrete filesystem (i.e. usually a disk partition or the logical
equivalent) can be a lot faster than an incremental backup of just
what's changed since the last backup, if you can afford the space and
transfer bandwidth taken by full backups. With a backup approach that
tracks changes as they happen rather than scanning for them
retrospectively, most of the effort of figuring out what to back up is
eliminated.

> Searching is another activity which is strained by per-file processing;
> the only way to _seem_ to compensate for that is by an extraordinary
[quoted text clipped - 3 lines]
> and which also commonly have more limitations in searching
> (e.g. only "whole words" [like Google], "words beginning with" etc.)

I think you are overstating this. Any mail storage model has to include
indexing of the message fields that it presents to users in a message
list, or the UI ends up painfully slow.

Message content searching is actually a pretty complex issue in regards
to how mail is stored. Search that is done inside a single-threaded MUA
is made a little easier by having a few large files rather than many
small ones because the need to open and close each file is avoided, but
when you look at the issue in relationship to the modern world of
multi-threaded apps on multi-core machines and search tools that work on
whole systems (e.g. Spotlight, Google Desktop Search, etc.) the picture
is less clear. One of the things Mac users sometimes complain about in
Eudora is that Spotlight can't be used to find text in specific
messages, and in fact there is no way to get that sort of search from
Spotlight without a file-per-message model, and it is unlikely that any
other system-wide search tool would be able to provide such a feature
without file-per-message storage.

> Even returning to the issue of local file storage, the NTFS (Windows)

Windows is off-topic here. :)

> file system makes an estimate, based on the size of an entire disk,
> how large a "Master File Table" to originally allocate;
[quoted text clipped - 8 lines]
> when it comes to being used for a long-term, permanent repository,
> accumulating an enormous number of permanent messages.

You might find it interesting that the file-per-message Maildir++ model
has been made popular primarily by the growing use of IMAP, where
servers retain messages indefinitely, not transiently.

The original Maildir model was developed as a way to simplify concurrent
access to a message store without needing to have all software aware of
a common locking mechanism. That may not strike you as relevant to a MUA
mailstore, but with multiprocessor/multicore/hyperthreaded machines
becoming the norm and desktop OS's long supporting multithreaded apps it
is actually a significant issue.

> Is there any successful database system, by the way,
> which stores "one record per file"?

It depends on your definition of "database" and "successful" I suppose.

The traditional news spool layout has retained a fair number of users
for decades (with the addition of overview indexing) in large part
because many people have found the accessibility to be useful, but also
because it avoids locking issues and allows multiple concurrent writers
to be active in the same directory (i.e. newsgroup.)

Signature

Now where did I hide that website...

John H Meyers - 04 Apr 2008 01:52 GMT
> FWIW, here is how [Time Machine] works...

Thanks for the info.

> That sort of backup mustering that is done all of the time by software
> that is hooked into the filesystem layer is where the future is for all
> routine backup...

> A full backup of a discrete filesystem (i.e. usually a disk partition
> or the logical equivalent) can be a lot faster than an incremental backup
> of just what's changed since the last backup...

> With a backup approach that tracks changes as they happen
> rather than scanning for them retrospectively, most of the effort
> of figuring out what to back up is eliminated.

Coming back down to the small task at hand, for me,
I just want an independent backup of my Eudora folder,
and even if my entire computer can be backed up over gigabit ethernet
to a Storage Area Network in the same time, I don't care :)

This is _personal_ backup, to be put on a CD,
or a USB stick, or "split" and emailed to myself at Gmail,
not a trip to Mars :)   So I use good old "zip folders,"
which is mighty universal and compatible.

If Odysseus stores one message per file,
the efficiency of my backup would sink faster than stocks in 1929,
and in fact it won't even be possible to do it any more.

>> Searching is another activity which is strained by per-file processing;
>> the only way to _seem_ to compensate for that is by an extraordinary
[quoted text clipped - 3 lines]
>> and which also commonly have more limitations in searching
>> (e.g. only "whole words" [like Google], "words beginning with" etc.)

> I think you are overstating this. Any mail storage model has to include
> indexing of the message fields that it presents to users in a message
> list, or the UI ends up painfully slow.

I have no indexes at all, other than my TOCs (which don't index
anything for searching, except that "summaries" searches the TOC itself),
and my searches are not only plenty fast enough, but I can search
even for embedded strings within words, etc.

Others who enable the "superfast X1 search" (Eudora v7 only, sorry)
write all the time about their computer "churning" and barely responding
while indexing (for rebuilding), about the huge indexes created,
about the inability to search text embedded in longer words,
about the indexes "breaking" fairly often, about losing
the flexibility to "drop in" or re-arrange mailboxes
and still keep searches working, etc.

So I like my searches better -- primitive implementation, not "super fast,"
but lowest overhead in other ways, and also most flexible.

With Odysseus' "one message per file," my searching would go down the drain,
or require me to build a highly complex and "indexed" system
for just a simple task.  Even though I have ten years' mail on hand,
Eudora's current "dumb" system works best for me.

> Message content searching is actually a pretty complex issue in regards
> to how mail is stored. Search that is done inside a single-threaded MUA
[quoted text clipped - 9 lines]
> other system-wide search tool would be able to provide such a feature
> without file-per-message storage.

I don't want a massive "system wide" tool for Eudora;
I want Eudora's own tool, which does the job for me perfectly.

"Lean and mean," less "big government bureaucracy" :)

>> Even returning to the issue of local file storage, the NTFS (Windows)..

> Windows is off-topic here. :)

There was a specific question about whether Windows' NTFS was comparable,
and Odysseus (the topic) is still cross-platform, and needs to suit all.

> You might find it interesting that the file-per-message Maildir++ model
> has been made popular primarily by the growing use of IMAP, where
> servers retain messages indefinitely, not transiently.

I did raise the point that it's more suitable for servers
than for personal computers :)

> The original Maildir model was developed as a way to simplify concurrent
> access to a message store without needing to have all software aware of
> a common locking mechanism. That may not strike you as relevant to a MUA
> mailstore...

Right :)

> but with multiprocessor/multicore/hyperthreaded machines
> becoming the norm and desktop OS's long supporting multithreaded apps
> it is actually a significant issue.

As I asked before, does this recommend the storage of database records
in a "one record per file" manner?

> The traditional news spool layout has retained a fair number of users
> for decades (with the addition of overview indexing) in large part
> because many people have found the accessibility to be useful, but also
> because it avoids locking issues and allows multiple concurrent writers
> to be active in the same directory (i.e. newsgroup.)

And they need not worry about space allocation (not using the same
physical blocks), or when they can re-use the space, how to assure
that no two write requests write into the same place, etc.?

There is always a "locking system" -- if you remove it from one place,
you have to make it re-appear in another.

Besides being again a _server_ issue, rather than a one-user-at-a-time
personal computer issue, you could do exactly the same by building
an "inner filing system" within a single OS file, which,
in effect, is what databases do, so I'm just not persuaded
that dumping the job onto the OS file system is a profound advance.

The thought to move it all to the OS file level
just doesn't grab me, but fortunately, I only "think small,"
and what I want is a personal email client that stands on its own,
doesn't make a mountain (of individual files)
out of what ain't the least broke, has fit the bill perfectly
for all this time, and don't need fixing :)

On the other hand, if we abandon the entire "mailbox" concept,
in favor of a "views" concept, like Gmail or Opera's mail,
then it becomes imperative to think of each item as independent,
absolutely requiring an extensive and more complex indexing system
(though not necessarily wasting so much space as "one message per file").

It's no longer Eudora, however, nor apparently even Odysseus.

--
Bill Cole - 04 Apr 2008 19:42 GMT
> > FWIW, here is how [Time Machine] works...
>
[quoted text clipped - 25 lines]
> the efficiency of my backup would sink faster than stocks in 1929,
> and in fact it won't even be possible to do it any more.

I guess I made my point poorly...

Your choice of a backup method for your mail is influenced by how the
mail is stored. Other methods exist that fit the bazillion-little-file
model better, and they are becoming more widespread in all sorts of
environments precisely because the numbers of files that people want to
keep backed up are exploding. On the Mac side, Time Machine is an
example, but my Windows colleagues have told me that there's some
similar thing on their side. Backup strategies traditionally have spent
a lot of compute time in order to economize on bytes transferred and
stored, but for desktop backup these days the real economy of that has
changed. I can't say what exactly the Windows alternatives are, but I'm
sure they must exist.

> >> Searching is another activity which is strained by per-file processing;
> >> the only way to _seem_ to compensate for that is by an extraordinary
[quoted text clipped - 12 lines]
> and my searches are not only plenty fast enough, but I can search
> even for embedded strings within words, etc.

Searches that are restricted to the header fields in the TOC use the TOC
as an index. Meta-data fields that only exist in the TOC use the TOC as
an index.

> Others who enable the "superfast X1 search" (Eudora v7 only, sorry)
> write all the time about their computer "churning" and barely responding
[quoted text clipped - 11 lines]
> for just a simple task.  Even though I have ten years' mail on hand,
> Eudora's current "dumb" system works best for me.

Again, I think you are overstating the slowdown from doing unindexed
searches of a large number of files. One of the reasons I use the
'tradspool' storage method (file per article) for most of my news server
is so that I can easily do arbitrary content searches (usually using
grep) across the whole spool (or in specific group subtrees) and find
specific articles, and doing so is not particularly slow.

> > Message content searching is actually a pretty complex issue in regards
> > to how mail is stored. Search that is done inside a single-threaded MUA
[quoted text clipped - 14 lines]
>
> "Lean and mean," less "big government bureaucracy" :)

And file-per-message does not prevent raw unindexed searches of all of
the message content by the MUA, if that's what one prefers. different
people prefer different approaches, and where mbox makes finding a
specific message outside of the MUA (or some other mbox-aware tool)
impossible, file-per-message allows anything that can read a text file
to do a search and find specific messages.

> >> Even returning to the issue of local file storage, the NTFS (Windows)..
>
[quoted text clipped - 9 lines]
> I did raise the point that it's more suitable for servers
> than for personal computers :)

I was trying to address your conjecture that it was designed to fit
transient spools rather than permanent storage.

> > The original Maildir model was developed as a way to simplify concurrent
> > access to a message store without needing to have all software aware of
[quoted text clipped - 9 lines]
> As I asked before, does this recommend the storage of database records
> in a "one record per file" manner?

If you don't want to implement a database that has record-level locking,
and want to allow access from other tools that don't know about such
locking or some internal data structure, yes.

> > The traditional news spool layout has retained a fair number of users
> > for decades (with the addition of overview indexing) in large part
[quoted text clipped - 5 lines]
> physical blocks), or when they can re-use the space, how to assure
> that no two write requests write into the same place, etc.?

Right. Filesystem implementations make certain operations 'atomic' from
the perspective of userspace software.

> There is always a "locking system" -- if you remove it from one place,
> you have to make it re-appear in another.

Right, and with file-per-message most locking issues are pushed down
into the filesystem so that userspace processes never see an
inconsistent state. For example, moving a message between mailboxes is
just a matter of changing which directory the file is linked into, and
userspace code cannot catch that procedure in a half-done state.

> Besides being again a _server_ issue, rather than a one-user-at-a-time
> personal computer issue,

One user at a time does not mean one task at a time. The clunky
'background' mail checking in Eudora is an example, and it is degraded
by having to manipulate messages in mbox files rather than files in
directories. Similar issues can occur outside of the MUA as well, since
things like backup software and AV scanners can catch an mbox mid-write
or out of sync with its TOC.

> you could do exactly the same by building
> an "inner filing system" within a single OS file, which,
> in effect, is what databases do, so I'm just not persuaded
> that dumping the job onto the OS file system is a profound advance.

That's another approach, but doing that moves a MUA in the direction of
Outlook, with its opaque .pst files that nothing but Outlook can touch.

> The thought to move it all to the OS file level
> just doesn't grab me, but fortunately, I only "think small,"
> and what I want is a personal email client that stands on its own,
> doesn't make a mountain (of individual files)
> out of what ain't the least broke, has fit the bill perfectly
> for all this time, and don't need fixing :)

And we come back to my original point: the different approaches to mail
storage all have their own advantages and disadvantages. Maildir-like
models may waste a little allocation block space and complicate
filesystem structures, but they can make multi-threaded MUA's smoother,
bind metadata tightly to the messages, and open up the mail store to
other tools. On the other hand, mbox storage forces the MUA to keep a
separate record of where messages sit in the file and per-message
metadata, and while they reduce filesystem complexity and theoretically
waste less allocated but not used space, in practice they usually are
not actually compacted on every message delete, so they have wastage
internally that is hard to see. Purpose-built database-like storage can
be very fast, efficient with space, support multi-threaded access and
provide neat tricks like 'views' but all of that comes at the cost of
hiding the messages from other tools.

> On the other hand, if we abandon the entire "mailbox" concept,
> in favor of a "views" concept, like Gmail or Opera's mail,
[quoted text clipped - 3 lines]
>
> It's no longer Eudora, however, nor apparently even Odysseus.

Signature

Now where did I hide that website...

John H Meyers - 13 Apr 2008 13:10 GMT
> File-per-message does not prevent raw unindexed searches of all of
> the message content by the MUA, if that's what one prefers. different
> people prefer different approaches, and where mbox makes finding a
> specific message outside of the MUA (or some other mbox-aware tool)
> impossible, file-per-message allows anything that can read a text file
> to do a search and find specific messages.

I think that's a "red herring."

Windows can use "handlers" to look inside of anything;
for example, some searches in XP look inside "zip" files by default
(much to the annoyance of many users, but one can obtain "tweaks"
to turn that off); other OS searches (for text) bypass certain files
by default, including Eudora's mailboxes -- but that can likewise
be turned back on, and since Eudora's mailboxes are just text,
it is perfectly possible to search within them for specific messages,
with no further "awareness" of "mbox format" than what you see
in front of your eyes in any text viewer.

No tool that is unaware of the MUA and its file structure
(or "handler" for access by external programs)
will fully replace the MUA's own tools, no matter what,
because it probably won't interpret HTML,
nor MIME encoded embedded objects, nor meta-data about messages
(e.g. status, labels, separate attachments, etc.)

I'm too lazy to further address other points (do I hear a sigh of relief? :)

--
Bill Cole - 14 Apr 2008 05:25 GMT
> > File-per-message does not prevent raw unindexed searches of all of
> > the message content by the MUA, if that's what one prefers. different
[quoted text clipped - 6 lines]
>
> Windows can use "handlers" to look inside of anything;

Completely irrelevant here. Frankly, I don't touch Windows when not paid
to do so. I regret the fact that Qualcomm fell into the trap of porting
Eudora to Windows, and my skepticism about Odysseus is grounded in the
fact that it is targeting the Windows commercial MUA market.

> for example, some searches in XP look inside "zip" files by default
> (much to the annoyance of many users, but one can obtain "tweaks"
[quoted text clipped - 4 lines]
> with no further "awareness" of "mbox format" than what you see
> in front of your eyes in any text viewer.

You didn't read all of what I wrote carefully enough. Understandable,
given my verbosity....

Sure, I can find any string inside a Eudora mailbox in Spotlight. It
will happily tell me which of the 269 mailboxes in my mail archives hold
some string, and that narrows my hunt to ~6MB per mailbox... So I can
use Eudora to search a subset.

Compare that to the Apple Mail experience. Spotlight will identify the
individual message, because messages are files. One search kicks up each
message with the search string, rather than a collection of mailboxes
with thousands of messages each.

> No tool that is unaware of the MUA and its file structure
> (or "handler" for access by external programs)
> will fully replace the MUA's own tools, no matter what,
> because it probably won't interpret HTML,
> nor MIME encoded embedded objects, nor meta-data about messages
> (e.g. status, labels, separate attachments, etc.)

Well, as far as the relevance here is concerned, even if one wanted to
create a special mail handler for Spotlight, there is no way to define
metadata in a handler that applies to a part of a file. In principle,
metadata that is applicable to a whole file can be exposed to Spotlight,
if there is some way to determine it from or attach it to the file.

> I'm too lazy to further address other points (do I hear a sigh of relief? :)

Yeah, well, I would find squabbling with me tiresome too....

Signature

Now where did I hide that website...

John H Meyers - 15 Apr 2008 15:30 GMT
> Sure, I can find any string inside a Eudora mailbox in Spotlight.
> It will happily tell me which of the 269 mailboxes in my mail archives
> hold some string, and that narrows my hunt to ~6MB per mailbox...
> So I can use Eudora to search a subset.

Or you can use any plain text viewer to find your string in that mailbox,
and there's (are) your message(s), exactly as you would find below, no?

And of course a little script can automate the whole thing.

> Compare that to the Apple Mail experience. Spotlight will identify the
> individual message, because messages are files. One search kicks up each
> message with the search string, rather than a collection of mailboxes
> with thousands of messages each.

So will super-fast "X1" (or a script, as above) in that "other OS,"
and without needing to split one's mail store into hundreds of thousands
of pieces to do it -- just like I'd rather have a CAT scan
than a dissection :)

I just can't wait to see multi-million record database searches
all based on Spotlight -- perhaps better than Google?  :)

--
Bill Cole - 16 Apr 2008 03:10 GMT
> > Sure, I can find any string inside a Eudora mailbox in Spotlight.
> > It will happily tell me which of the 269 mailboxes in my mail archives
[quoted text clipped - 3 lines]
> Or you can use any plain text viewer to find your string in that mailbox,
> and there's (are) your message(s), exactly as you would find below, no?

From a user interaction perspective? Not at all.

> And of course a little script can automate the whole thing.

That's of interest to scripters, but that's a pretty small niche.

Signature

Now where did I hide that website...

John H Meyers - 16 Apr 2008 23:34 GMT
On Tue, 15 Apr 2008 21:10:55 -0500:

>> And of course a little script can automate the whole thing.

> That's of interest to scripters, but that's a pretty small niche.

Scripts are posted all the time, one person write, all others use :)

--
John H Meyers - 03 Apr 2008 06:08 GMT
> Eudora mailboxes are limited to 32767 messages.

On Mac; fixed on Windows.

In archives (e.g. zip) it's _files_ (mailboxes)
which are counted, however, not messages,
so multi-message mailboxes (even a mere 32767 messages each)
raise the limit on "messages per archive"
by a factor of tens of thousands of times as many,
compared to the limit using "one message per file."

The latter runs into a brick wall at 65535 objects
per "classic" zip archive (limit removed
by less universal "extended" formats tied to specific vendors,
each of which "encourages" everyone else to adopt theirs :)

http://schmidt.devlib.org/file-formats/zip-archive-file-format.html
http://www.honeynet.org/scans/scan24/sol/pedram/reference/zip-archive-file-forma
t.html


--
Bill Cole - 03 Apr 2008 17:02 GMT
> > Eudora mailboxes are limited to 32767 messages.
>
> On Mac; fixed on Windows.

Which is off-topic in *this* newsgroup, but it is worth mentioning in a
context that might be on-topic here that the TOC differences break
cross-platform portability of Eudora mailboxes, so that if a former
Windows user wants to bring very large Eudora mail archives over to Mac,
they may need to be broken into smaller mailboxes.

(and yes, I am one of those people who believe that 1994 was a tragic
year for Eudora... )

> In archives (e.g. zip) it's _files_ (mailboxes)
> which are counted, however, not messages,
[quoted text clipped - 11 lines]
> http://www.honeynet.org/scans/scan24/sol/pedram/reference/zip-archive-file-for
> mat.html

It is odd that both of those say nothing about a files/archive limit,
when that's what you are talking about. And by the way, you are not
quite correct.

The definitive description of the Zip format is
http://www.pkware.com/documents/casestudies/APPNOTE.TXT, which describes
the Zip64 extensions to support largefiles and a 64-bit files/archive
field. Zip64 is *NOT* legally encumbered, and there are free (public
domain Java and BSD-licensed C) implementations available so that anyone
who uses Zip and wants to get past the 64k limit but lacks adequate
programming skills (e.g. the guys up in Redmond) can freely borrow the
work done by their betters.

I should add that I don't think that the *technical* hard limits on the
number of messages in a mailbox (no matter how that is implemented) are
really significant in a MUA to 99.9% of users, because the human factors
kick in at much lower levels than 32k messages in one place and drive
most people to subdivide even historical archives into smaller sets that
they can work with and not feel lost in. Once in a while someone shows
up here and moans about the 32k messages/mailbox problem, but it always
seems to be someone trying to make Eudora into a data mining tool or
some other sort of gross misapplication. For the most part, people do
not want or need their MUA to present a collection of tens of thousands
of messages to them in one flat list, so hard technical limits built
into a storage model that end up only limiting that sort of use don't
mean a lot pragmatically.

Signature

Now where did I hide that website...

John H Meyers - 04 Apr 2008 00:47 GMT
>> > Eudora mailboxes are limited to 32767 messages.

>> On Mac; fixed on Windows.

> Which is off-topic in *this* newsgroup

Odysseus (thread topic) is cross-platform,
so its issues and comparisons may also be.

> but it is worth mentioning in a
> context that might be on-topic here that the TOC differences break
> cross-platform portability of Eudora mailboxes

What about the line endings (which, if they _have_ to be changed,
necessarily also change relative message offsets in the TOC)?

One might have wished that each program could have accepted
all sensible line endings; would that have made compatibility possible?

I don't know what removed the 32767 limit from Windows,
but a user isn't asked to reformat TOCs, and I've seen someone claim
that they can inter-operate all versions (alternately) at any time,
on the same platform of course.

> so that if a former Windows user wants to bring very large Eudora mail archives
> over to Mac, they may need to be broken into smaller mailboxes.
>
> (and yes, I am one of those people who believe that 1994 was a tragic
> year for Eudora... )

I don't know the inference, but you can enlighten me if you wish.

>> In archives (e.g. zip) it's _files_ (mailboxes)
>> which are counted, however, not messages,
[quoted text clipped - 10 lines]
>> http://schmidt.devlib.org/file-formats/zip-archive-file-format.html
>> http://www.honeynet.org/scans/scan24/sol/pedram/reference/zip-archive-file-forma
t.html

> It is odd that both of those say nothing about a files/archive limit,
> when that's what you are talking about.

I listed those as places to find general information about zip file formats,
not intended as related only to the last prior sentence.

> And by the way, you are not quite correct.
>
> The definitive description of the Zip format is
> http://www.pkware.com/documents/casestudies/APPNOTE.TXT, which describes
> the Zip64 extensions to support largefiles and a 64-bit files/archive field.

As far as I can see, BOTH pkware AND Winzip have "defined" their own _extensions_
and each has "invited" everyone else to adopt theirs,
which is why I didn't make links to either one of them.

Now, are both company's extensions compatible?

I've thought not, but if they are, my next paragraphs are moot,
and I'll be glad of it:

Just because pkware was the earlier company does not mean that they
can control everything that becomes of material released to public domain,
any more than IBM controlled the DES encryption that they originated, etc.

But I didn't set out to argue which is "right" or "better" -- only to mention
that if "extended" formats are incompatible (and if Windows
hasn't adopted either as built-in), then "extended" zip archives
are tied (for most users who just want to use pre-packaged software)
to whichever system they make them with (or others who decide
to make themselves compatible, in case "7-Zip" etc.
join any particular bandwagon, or perhaps work with both,
joining the crowd who tend to say "I'll decode anything,
but I'll only create mine" -- which is just like email, where it's
"I'll import from any other program, but there's no export" :)

> Zip64 is *NOT* legally encumbered, and there are free (public
> domain Java and BSD-licensed C) implementations available so that anyone
> who uses Zip and wants to get past the 64k limit but lacks adequate
> programming skills (e.g. the guys up in Redmond) can freely borrow the
> work done by their betters.

Enjoy the crusade :)

> I should add that I don't think that the *technical* hard limits on the
> number of messages in a mailbox (no matter how that is implemented) are
[quoted text clipped - 9 lines]
> into a storage model that end up only limiting that sort of use don't
> mean a lot pragmatically.

I don't know where this is going, but my point, re Odysseus' file storage
(which started as the topic :) is that if IDS stores only one message per file,
the rather small limit of 65537 objects in the original zip format
will require many people to go to a less universal "extended" format
to make zip archives, while good old "Classic" Eudora's MBOX format,
even if limited to 32767 messages per mailbox, could have expanded that limit
(where "classical zip" format gives out) some tens of thousands of times over.

In other words, my whole point was that IDS is making a poor choice,
and I brought up this whole "zip business" (a common archiving method)
to illustrate one of the many reasons why it's poor.

"Remember Alice? -- this is a song about Alice!"
[Arlo Guthrie - "Alice's Restaurant"]
http://www.arlo.net/resources/lyrics/alices.shtml
http://youtube.com/watch?v=5_7C0QGkiVo
Bill Cole - 04 Apr 2008 16:24 GMT
> >> > Eudora mailboxes are limited to 32767 messages.
>
[quoted text clipped - 11 lines]
> What about the line endings (which, if they _have_ to be changed,
> necessarily also change relative message offsets in the TOC)?

Yes, if Windows mailboxes use CRLF (i.e. normal DOS/Windows  
linebreaking) instead of the bare CR used on the Mac, that would also
break portability.

> One might have wished that each program could have accepted
> all sensible line endings; would that have made compatibility possible?

In theory yes, but it is a harder thing to do than to postulate, since
it would require a strategy to deal with edge cases in messages that use
8-bit encodings and non-text content.

Eudora *could* have standardized on One True Format across platforms,
and that would have been be a simpler choice.

> I don't know what removed the 32767 limit from Windows,
> but a user isn't asked to reformat TOCs, and I've seen someone claim
> that they can inter-operate all versions (alternately) at any time,
> on the same platform of course.

On the Mac side it is a bit dumb: the MacOS Resource Manager puts a hard
limit on the size of individual resources and the full resource fork,
and the way Eudora lays out the TOC eats it all at 32k messages, largely
filled with nulls.

> > so that if a former Windows user wants to bring very large Eudora mail
> > archives
[quoted text clipped - 4 lines]
>
> I don't know the inference, but you can enlighten me if you wish.

The year Eudora was ported to Windows (and the last year that the
current source was published.)

Competing with MS on their own turf is never a wise move. Eudora was the
best Mac e-mail client and Qualcomm had adequate resources to keep it
that way, had they resisted the urge to go after the PC side of the
market. They spread their resources in a non-core area of their business
too thin and flailed around trying to flank MS with ill-considered
features that never really caught on.

> >> In archives (e.g. zip) it's _files_ (mailboxes)
> >> which are counted, however, not messages,
[quoted text clipped - 29 lines]
> and each has "invited" everyone else to adopt theirs,
> which is why I didn't make links to either one of them.

If you know of any technical spec for the WinZip extensions that are  
not part of the PKware "appnote" specification of the Zip file format, I
would love to have a link.  

> Now, are both company's extensions compatible?
>
> I've thought not, but if they are, my next paragraphs are moot,

My understanding from looking at the available documentation is that
WinZip 9 and later have supported the PKWare 64-bit extensions
completely and the differences between the PKWare and WinZip formats is
in the encryption implementations.

> and I'll be glad of it:

I think you can be glad.

> Just because pkware was the earlier company does not mean that they
> can control everything that becomes of material released to public domain,
> any more than IBM controlled the DES encryption that they originated, etc.

It's hard to make that analogy between an algorithm invented with the
specific goal of becoming a government standard and a file format
invented for use in commercial software and made open by choice.  PKWare
defined the original format and published it in full and have supported
the format as an evolving open standard that anyone can implement
freely.  

> But I didn't set out to argue which is "right" or "better" -- only to mention
> that if "extended" formats are incompatible (and if Windows
[quoted text clipped - 14 lines]
>
> Enjoy the crusade :)

No crusade involved, just noting that the problem you are citing is a
problem of one vendor being behind the times in their implementation of
an evolving open format.

My real point that lead to this tangential diversion is that the
problems specific to each of the options for how a MUA can store mail
are limited to the ways particular subsets of users work with their mail
and with their mail as part of their whole environment. The
file-per-message approach certainly does work less well with some ways
that people like to search and back up their mail. The
mbox-file-per-logical-mailbox approach used by classical Eudora presents
its own set of problems with other ways that other people prefer. There
are alternatives that may partially address all of those problems at the
cost of raising other issues such as transparency and complexity. I
think that the bottom line for any new MUA is that because all of the
solutions are imperfect, some compromise that addresses the problems of
any simplistic model has to be made or else some significant slice of
users won't use the new tool. One advantage Odysseus has over any
existing MUA is that it has no legacy other than the loose affinity to
the Eudora users, so they can pick an approach and tune it up in the
early versions rather than be bound to some historical choice.

Signature

Now where did I hide that website...

R. Millstein - 02 Apr 2008 02:10 GMT
> On Sun, 30 Mar 2008 18:45:45 -0500, Roberta Millstein wrote:
>
[quoted text clipped - 4 lines]
> "Maildir" format stores one message per file;
> I have heard that Odysseus will use one folder per mailbox.

That was the original plan, but as I wrote above, they seem to have
changed their minds.

Roberta
Signature

Roberta Millstein
usenet@spamaway.rlm.net
Remove "spamaway" to reply

--
Posted via a free Usenet account from http://www.teranews.com

 
Sign In
Join
My Latest Posts
My Monitored Threads
My Blog
My Photo Gallery
My Profile
My Homepage

Start New Thread
Enable EMail Alerts
Rate this Thread



©2008 Advenet LLC   Privacy Policy - Terms of Use
This website includes both content owned or controlled by Advenet as well as content owned or controlled by third parties.