Paste Problem
|
|
Thread rating:  |
kevs - 25 Aug 2007 18:41 GMT I'm pasting text from a .txt file into word and it coming up looking bad, anything I can do?
This file originates from a pdf, which I cannot see option to get into word direct.
Kevs
OS 10.4.7 Office 2004
JE McGimpsey - 25 Aug 2007 22:22 GMT > I'm pasting text from a .txt file into word and it coming up looking bad, > anything I can do? kevs, you've been posting here long enough to know that your description is useless in helping someone to figure out what's wrong...
What does "looking bad" mean?
A .txt file doesn't have any formatting associated with it. Have you applied your preferred style(s) to the pasted text?
kevs - 01 Sep 2007 18:56 GMT On 8/25/07 2:22 PM, in article jemcgimpsey-05123C.15220625082007@msnews.microsoft.com, "JE McGimpsey" <jemcgimpsey@mvps.org> wrote:
>> I'm pasting text from a .txt file into word and it coming up looking bad, >> anything I can do? [quoted text clipped - 6 lines] > A .txt file doesn't have any formatting associated with it. Have you > applied your preferred style(s) to the pasted text? JE: What's happening is I'm seeing many words bleed together after the paste. Spacing is off.
OS 10.4.7 Office 2004
CyberTaz - 01 Sep 2007 19:14 GMT Try Command+A, Edit> Clear> Clear Formatting and then either change the font to one of the standards (such as Times New Roman) or apply a style.
HTH |:>) Bob Jones [MVP] Office:Mac
On 9/1/07 1:56 PM, in article C2FEF5C0.2A632%kevs@hotmail.com, "kevs"
> On 8/25/07 2:22 PM, in article > jemcgimpsey-05123C.15220625082007@msnews.microsoft.com, "JE McGimpsey" [quoted text clipped - 16 lines] > OS 10.4.7 > Office 2004 kevs - 10 Sep 2007 23:36 GMT On 9/1/07 11:14 AM, in article C2FF243A.2A838%onlygeneraltaz1@com.cast.net,
> Try Command+A, Edit> Clear> Clear Formatting and then either change the font > to one of the standards (such as Times New Roman) or apply a style. [quoted text clipped - 25 lines] >> OS 10.4.7 >> Office 2004 Sorry Bob, just realized that the .txt file it's coming from has the word bleed problem. So my question is: how do I get a Word file straight from the PDF? I don't see it offered. It offers Photoshop etc but not word? I don't get it.
OS 10.4.7 Office 2004
Elliott Roper - 11 Sep 2007 00:10 GMT <snip>
> Sorry Bob, just realized that the .txt file it's coming from has the word > bleed problem. So my question is: how do I get a Word file straight from > the PDF? I don't see it offered. It offers Photoshop etc but not word? I > don't get it. PDF is not meant to be edited. You are lucky it is that good. A PDF is only meant to look good on the page, there is no reason why the characters could not have been placed there in any order at all in the file, then placed just so to fool a human reader.
A neat trick for plagiarising PDF is to OCR the PDF. One of the professional versions of Acrobat does just that, but any old OCR program should be able to eat PDF for breakfast. If not save the thing as a tiff, and give the OCR that to eat.
 Signature To de-mung my e-mail address:- fsnospam$elliott$$ PGP Fingerprint: 1A96 3CF7 637F 896B C810 E199 7E5C A9E4 8E59 E248
kevs - 13 Sep 2007 20:01 GMT On 9/10/07 4:10 PM, in article 110920070010220182%nospam@yrl.co.uk, "Elliott Roper" <nospam@yrl.co.uk> wrote:
> <snip> >> Sorry Bob, just realized that the .txt file it's coming from has the word [quoted text clipped - 10 lines] > program should be able to eat PDF for breakfast. If not save the thing > as a tiff, and give the OCR that to eat. What is OCR? So is there verdict I can't get it properly in Word doc?
OS 10.4.7 Office 2004
Phillip Jones - 13 Sep 2007 21:05 GMT OCR stands for Optical Character Recognition.
You scan document in a scanner and special software (usually doesn't work that well PC or Mac) tries to read it and convert to an editable document. Once converted can be opened by Word, or excel depending upon what the document is.
> On 9/10/07 4:10 PM, in article 110920070010220182%nospam@yrl.co.uk, "Elliott > Roper" <nospam@yrl.co.uk> wrote: [quoted text clipped - 18 lines] > OS 10.4.7 > Office 2004
 Signature ------------------------------------------------------------------------ Phillip M. Jones, CET |LIFE MEMBER: VPEA ETA-I, NESDA, ISCET, Sterling 616 Liberty Street |Who's Who. PHONE:276-632-5045, FAX:276-632-0868 Martinsville Va 24112 |pjones@kimbanet.com, ICQ11269732, AIM pjonescet ------------------------------------------------------------------------
If it's "fixed", don't "break it"!
mailto:pjones@kimbanet.com
<http://www.kimbanet.com/~pjones/default.htm> <http://www.kimbanet.com/~pjones/90th_Birthday/index.htm> <http://www.kimbanet.com/~pjones/Fulcher/default.html> <http://www.kimbanet.com/~pjones/Harris/default.htm> <http://www.kimbanet.com/~pjones/Jones/default.htm>
<http://www.vpea.org>
Elliott Roper - 13 Sep 2007 22:48 GMT > OCR stands for Optical Character Recognition. > > You scan document in a scanner and special software (usually doesn't > work that well PC or Mac) tries to read it and convert to an editable > document. Once converted can be opened by Word, or excel depending upon > what the document is. Well, it beats the B'Jasus out of typing it all back in again! I've got an ancient copy of ReadIris that came on a magazine CD. It works a treat when all else fails. PDFs, as long as they are not just containers for scruffy bitmap images of text as some of them are, are a great starting point for OCR. You don't need to print and scan the little snivellers. Any halfway decent OCR program will work from any kind of image.
 Signature To de-mung my e-mail address:- fsnospam$elliott$$ PGP Fingerprint: 1A96 3CF7 637F 896B C810 E199 7E5C A9E4 8E59 E248
Phillip Jones - 14 Sep 2007 20:00 GMT >> OCR stands for Optical Character Recognition. >> [quoted text clipped - 9 lines] > the little snivellers. Any halfway decent OCR program will work from > any kind of image. While OCR does work when through there is a lot of editing and spell checking to be done. OCR is a work in progress and will be years until almost perfect.
 Signature ------------------------------------------------------------------------ Phillip M. Jones, CET |LIFE MEMBER: VPEA ETA-I, NESDA, ISCET, Sterling 616 Liberty Street |Who's Who. PHONE:276-632-5045, FAX:276-632-0868 Martinsville Va 24112 |pjones@kimbanet.com, ICQ11269732, AIM pjonescet ------------------------------------------------------------------------
If it's "fixed", don't "break it"!
mailto:pjones@kimbanet.com
<http://www.kimbanet.com/~pjones/default.htm> <http://www.kimbanet.com/~pjones/90th_Birthday/index.htm> <http://www.kimbanet.com/~pjones/Fulcher/default.html> <http://www.kimbanet.com/~pjones/Harris/default.htm> <http://www.kimbanet.com/~pjones/Jones/default.htm>
<http://www.vpea.org>
Clive Huggan - 15 Sep 2007 00:31 GMT On 15/9/07 5:00 AM, in article #3tD$Fw9HHA.1416@TK2MSFTNGP03.phx.gbl,
>>> OCR stands for Optical Character Recognition. >>> [quoted text clipped - 13 lines] > checking to be done. OCR is a work in progress and will be years until > almost perfect. That's not my experience with OmniPage Pro X, Phillip -- whether recognizing directly from a scan or via PDF -- unless the original is say, half the quality of a fax sent on standard setting. In fact I'm amazed at how often it interprets marginal-quality text correctly. That's why I quite frequently use OCR to suck text off scanned PDFs, as Elliott describes.
Cheers,
Clive ======
kevs - 23 Sep 2007 20:13 GMT On 9/14/07 4:31 PM, in article C31156F9.2FBAE%REMOVETHISoffice@ANDTHISstrategists.com.au, "Clive Huggan" <REMOVETHISoffice@ANDTHISstrategists.com.au> wrote:
> On 15/9/07 5:00 AM, in article #3tD$Fw9HHA.1416@TK2MSFTNGP03.phx.gbl, > [quoted text clipped - 26 lines] > Clive > ====== So an OCR can work right off a PDF? (no scanning needed) Any recommendations?
OS 10.4.7 Office 2004
Clive Huggan - 23 Sep 2007 22:41 GMT On 24/9/07 5:13 AM, in article C31C0907.2B170%kevs@hotmail.com, "kevs"
> On 9/14/07 4:31 PM, in article > C31156F9.2FBAE%REMOVETHISoffice@ANDTHISstrategists.com.au, "Clive Huggan" [quoted text clipped - 36 lines] > OS 10.4.7 > Office 2004 As I said, Kevs, OmniPage Pro X reads PDFs without having to scan. I don't have recent experience of any other OCR applications. The Mac version hasn't been updated for some years, whereas the PC version is being continually improved, but it does what I want, to the fairly high standard I need.
Clive Huggan ============
kevs - 26 Sep 2007 04:18 GMT On 9/23/07 2:41 PM, in article C31D1AC5.303E3%REMOVETHISoffice@ANDTHISstrategists.com.au, "Clive Huggan" <REMOVETHISoffice@ANDTHISstrategists.com.au> wrote:
> On 24/9/07 5:13 AM, in article C31C0907.2B170%kevs@hotmail.com, "kevs" > [quoted text clipped - 47 lines] > Clive Huggan > ============ Thanks Clive, just version tracked it, and yes, way out of date and silly overpriced. Any other ideas?
OS 10.4.7 Office 2004
|
|
|