Home | Contact Us | FAQ | Search & Site Map | Link to Us
Sign In | Join | Other 45 Sites in Network
Home
Discussion Groups
General
GeneralPortable MacsHardwareNetworking
Applications
Mac ApplicationsEudoraFirefox / MozillaInternet ExplorerOutlook ExpressMS OfficeEntourageExcelPowerPointWordVirtual PCMedia PlayerOther MS Products
Programming
Mac ProgrammingCodeWarriorPerl
Country Specific
Australian Mac GroupUK Mac Group

Mac Forum / Applications / Word / September 2007



Tip: Looking for answers? Try searching our database.

Paste Problem

Thread view: 
Enable EMail Alerts  Start New Thread
Thread rating: 
kevs - 25 Aug 2007 18:41 GMT
I'm pasting text from a .txt file into word and it coming up looking bad,
anything I can do?

This file originates from a pdf, which I cannot see option to get into word
direct.

Kevs

OS 10.4.7
Office 2004
JE McGimpsey - 25 Aug 2007 22:22 GMT
> I'm pasting text from a .txt file into word and it coming up looking bad,
> anything I can do?

kevs, you've been posting here long enough to know that your description
is useless in helping someone to figure out what's wrong...

What does "looking bad" mean?

A .txt file doesn't have any formatting associated with it. Have you
applied your preferred style(s) to the pasted text?
kevs - 01 Sep 2007 18:56 GMT
On 8/25/07 2:22 PM, in article
jemcgimpsey-05123C.15220625082007@msnews.microsoft.com, "JE McGimpsey"
<jemcgimpsey@mvps.org> wrote:

>> I'm pasting text from a .txt file into word and it coming up looking bad,
>> anything I can do?
[quoted text clipped - 6 lines]
> A .txt file doesn't have any formatting associated with it. Have you
> applied your preferred style(s) to the pasted text?
JE:
What's happening is I'm seeing many words bleed together after the paste.
Spacing is off.

OS 10.4.7
Office 2004
CyberTaz - 01 Sep 2007 19:14 GMT
Try Command+A, Edit> Clear> Clear Formatting and then either change the font
to one of the standards (such as Times New Roman) or apply a style.

HTH |:>)
Bob Jones
[MVP] Office:Mac

On 9/1/07 1:56 PM, in article C2FEF5C0.2A632%kevs@hotmail.com, "kevs"

> On 8/25/07 2:22 PM, in article
> jemcgimpsey-05123C.15220625082007@msnews.microsoft.com, "JE McGimpsey"
[quoted text clipped - 16 lines]
> OS 10.4.7
> Office 2004
kevs - 10 Sep 2007 23:36 GMT
On 9/1/07 11:14 AM, in article C2FF243A.2A838%onlygeneraltaz1@com.cast.net,

> Try Command+A, Edit> Clear> Clear Formatting and then either change the font
> to one of the standards (such as Times New Roman) or apply a style.
[quoted text clipped - 25 lines]
>> OS 10.4.7
>> Office 2004

Sorry Bob, just realized that the .txt file it's coming from has the word
bleed problem.  So my question is: how do I get a Word file straight from
the PDF? I don't see it offered. It offers Photoshop etc but not word? I
don't get it.

OS 10.4.7
Office 2004
Elliott Roper - 11 Sep 2007 00:10 GMT
<snip>
> Sorry Bob, just realized that the .txt file it's coming from has the word
> bleed problem.  So my question is: how do I get a Word file straight from
> the PDF? I don't see it offered. It offers Photoshop etc but not word? I
> don't get it.
PDF is not meant to be edited. You are lucky it is that good.
A PDF is only meant to look good on the page, there is no reason why
the characters could not have been placed there in any order at all in
the file, then placed just so to fool a human reader.

A neat trick for plagiarising PDF is to OCR the PDF. One of the
professional versions of Acrobat does just that, but any old OCR
program should be able to eat PDF for breakfast. If not save the thing
as a tiff, and give the OCR that to eat.

Signature

To de-mung my e-mail address:- fsnospam$elliott$$
PGP Fingerprint: 1A96 3CF7 637F 896B C810  E199 7E5C A9E4 8E59 E248

kevs - 13 Sep 2007 20:01 GMT
On 9/10/07 4:10 PM, in article 110920070010220182%nospam@yrl.co.uk, "Elliott
Roper" <nospam@yrl.co.uk> wrote:

> <snip>
>> Sorry Bob, just realized that the .txt file it's coming from has the word
[quoted text clipped - 10 lines]
> program should be able to eat PDF for breakfast. If not save the thing
> as a tiff, and give the OCR that to eat.
What is OCR?
So is there  verdict I can't get it properly in Word doc?

OS 10.4.7
Office 2004
Phillip Jones - 13 Sep 2007 21:05 GMT
OCR stands for Optical Character Recognition.

You scan document in a scanner and special software (usually doesn't
work that well PC or Mac) tries to read it and convert to an editable
document. Once converted can be opened by Word, or excel depending upon
what the document is.

> On 9/10/07 4:10 PM, in article 110920070010220182%nospam@yrl.co.uk, "Elliott
> Roper" <nospam@yrl.co.uk> wrote:
[quoted text clipped - 18 lines]
> OS 10.4.7
> Office 2004

Signature

------------------------------------------------------------------------
Phillip M. Jones, CET   |LIFE MEMBER: VPEA ETA-I, NESDA, ISCET, Sterling
616 Liberty Street      |Who's Who. PHONE:276-632-5045, FAX:276-632-0868
Martinsville Va 24112   |pjones@kimbanet.com, ICQ11269732, AIM pjonescet
------------------------------------------------------------------------

If it's "fixed", don't "break it"!

mailto:pjones@kimbanet.com

<http://www.kimbanet.com/~pjones/default.htm>
<http://www.kimbanet.com/~pjones/90th_Birthday/index.htm>
<http://www.kimbanet.com/~pjones/Fulcher/default.html>
<http://www.kimbanet.com/~pjones/Harris/default.htm>
<http://www.kimbanet.com/~pjones/Jones/default.htm>

<http://www.vpea.org>

Elliott Roper - 13 Sep 2007 22:48 GMT
> OCR stands for Optical Character Recognition.
>
> You scan document in a scanner and special software (usually doesn't
> work that well PC or Mac) tries to read it and convert to an editable
> document. Once converted can be opened by Word, or excel depending upon
> what the document is.
Well, it beats the B'Jasus out of typing it all back in again!
I've got an ancient copy of ReadIris that came on a magazine CD.
It works a treat when all else fails. PDFs, as long as they are not
just containers for scruffy bitmap images of text as some of them are,
are a great starting point for OCR. You don't need to print and scan
the little snivellers. Any halfway decent OCR program will work from
any kind of image.

Signature

To de-mung my e-mail address:- fsnospam$elliott$$
PGP Fingerprint: 1A96 3CF7 637F 896B C810  E199 7E5C A9E4 8E59 E248

Phillip Jones - 14 Sep 2007 20:00 GMT
>> OCR stands for Optical Character Recognition.
>>
[quoted text clipped - 9 lines]
> the little snivellers. Any halfway decent OCR program will work from
> any kind of image.

While OCR does work when through there is a lot of editing and spell
checking to be done. OCR is a work in progress and will be years until
almost perfect.

Signature

------------------------------------------------------------------------
Phillip M. Jones, CET   |LIFE MEMBER: VPEA ETA-I, NESDA, ISCET, Sterling
616 Liberty Street      |Who's Who. PHONE:276-632-5045, FAX:276-632-0868
Martinsville Va 24112   |pjones@kimbanet.com, ICQ11269732, AIM pjonescet
------------------------------------------------------------------------

If it's "fixed", don't "break it"!

mailto:pjones@kimbanet.com

<http://www.kimbanet.com/~pjones/default.htm>
<http://www.kimbanet.com/~pjones/90th_Birthday/index.htm>
<http://www.kimbanet.com/~pjones/Fulcher/default.html>
<http://www.kimbanet.com/~pjones/Harris/default.htm>
<http://www.kimbanet.com/~pjones/Jones/default.htm>

<http://www.vpea.org>

Clive Huggan - 15 Sep 2007 00:31 GMT
On 15/9/07 5:00 AM, in article #3tD$Fw9HHA.1416@TK2MSFTNGP03.phx.gbl,

>>> OCR stands for Optical Character Recognition.
>>>
[quoted text clipped - 13 lines]
> checking to be done. OCR is a work in progress and will be years until
> almost perfect.

That's not my experience with OmniPage Pro X, Phillip -- whether recognizing
directly from a scan or via PDF -- unless the original is say, half the
quality of a fax sent on standard setting.  In fact I'm amazed at how often
it interprets marginal-quality text correctly. That's why I quite frequently
use OCR to suck text off scanned PDFs, as Elliott describes.

Cheers,

Clive
======
kevs - 23 Sep 2007 20:13 GMT
On 9/14/07 4:31 PM, in article
C31156F9.2FBAE%REMOVETHISoffice@ANDTHISstrategists.com.au, "Clive Huggan"
<REMOVETHISoffice@ANDTHISstrategists.com.au> wrote:

> On 15/9/07 5:00 AM, in article #3tD$Fw9HHA.1416@TK2MSFTNGP03.phx.gbl,
>
[quoted text clipped - 26 lines]
> Clive
> ======

So an OCR can work right off a PDF? (no scanning needed)  Any
recommendations?

OS 10.4.7
Office 2004
Clive Huggan - 23 Sep 2007 22:41 GMT
On 24/9/07 5:13 AM, in article C31C0907.2B170%kevs@hotmail.com, "kevs"

> On 9/14/07 4:31 PM, in article
> C31156F9.2FBAE%REMOVETHISoffice@ANDTHISstrategists.com.au, "Clive Huggan"
[quoted text clipped - 36 lines]
> OS 10.4.7
> Office 2004

As I said, Kevs, OmniPage Pro X reads PDFs without having to scan.  I don't
have recent experience of any other OCR applications.  The Mac version
hasn't been updated for some years, whereas the PC version is being
continually improved, but it does what I want, to the fairly high standard I
need.

Clive Huggan
============
kevs - 26 Sep 2007 04:18 GMT
On 9/23/07 2:41 PM, in article
C31D1AC5.303E3%REMOVETHISoffice@ANDTHISstrategists.com.au, "Clive Huggan"
<REMOVETHISoffice@ANDTHISstrategists.com.au> wrote:

> On 24/9/07 5:13 AM, in article C31C0907.2B170%kevs@hotmail.com, "kevs"
>
[quoted text clipped - 47 lines]
> Clive Huggan
> ============

Thanks Clive, just version tracked it, and yes, way out of date and silly
overpriced. Any other ideas?

OS 10.4.7
Office 2004
 
Sign In
Join
My Latest Posts
My Monitored Threads
My Blog
My Photo Gallery
My Profile
My Homepage

Start New Thread
Enable EMail Alerts
Rate this Thread



©2008 Advenet LLC   Privacy Policy - Terms of Use
This website includes both content owned or controlled by Advenet as well as content owned or controlled by third parties.