Spotlight not indexing/searching text WITHIN .ppt or .pptx in tiger
|
|
Thread rating:  |
tkjazzer@officeformac.com - 03 Apr 2008 18:56 GMT Version: 2008 Operating System: Mac OS X 10.4 (Tiger) Processor: intel
Hello,
Spotlight is not indexing / searching text within .ppt and .pptx files (and it never has for me).
My PDFs index great and I can quickly find which lecture handout had the word "methanol" in it, but I can't quickly find the .ppt and .pptx files that had powerpoint in it.
I read on macrumors.com that indexing works for some people. Why isn't mine working? What can I do to fix it?
tkjazzer@officeformac.com - 03 Apr 2008 18:57 GMT for me, spotlight will show a .ppt file ONLY if the searched word is in the filename.
tkjazzer
Corentin Cras-Méneur - 04 Apr 2008 16:31 GMT > Spotlight is not indexing / searching text within .ppt and .pptx files > (and it never has for me). The mdimporter is actually provided through Apple by system updates. It was working fine for me for ppt files, but wasn't up to date enough to support pptx files. Since I now run Tiger, I have no idea as to whether or not it has been updated for Office 2008 files for Tiger as well. What version is the file in /Library/Spotlight???
If the ppt files are not properly indexed, then you probably have a Spotlight issue on your Mac. You might need to trigger a full re-indexing of the drive (you can do it through the command line, but I'm sure you can also find a feew freeware utiilties to do it for you on Google or VersionTracker),
Corentin
 Signature --- Mac:MS MVP http://www.cortig.net/wordpress/ --- http://www.mvps.org - http://mvp.support.microsoft.com MVPs are not MS employees - Les MVP ne travaillent pas pour MS Remove "NoSpam" to e-mail me - Retirez "NoSpam" pour m'écrire
tkjazzer@officeformac.com - 05 Apr 2008 08:19 GMT So, I put my HD and various other folders into the spotlight preferences, privacy not to index... then removed it to trigger indexing.
I then restarted and noticed spotlight was indexing.
However, the indexing finished quite quickly.
I noticed that my main folder was not indexed.
I checked the activity monitor for mdimport and it seems to have on and off activity with the .ppt files in that folder.
However, the files in that folder that are slowly coming online are only indexed by their filename - the text within the powerpoint slides is not.
for example, the first slide has the professor's first name, anthony... where no filename has his first name... and the file does not appear. the same file has the professors last name in the filename and when you type that... the file appears.
so it appears that spotlight didn't get everything when it first indexed after restart and has been slowly working in the background since.
I don't know why it just didn't get the index when it said it was goign to take 2 hours... (although it went much too fast for 2 hours)
I still can't figure out why the text inside the .ppt files is not indexed.
Do I have to open each individual .ppt so that OS Tiger 10.4.11 knows what the text is in each powerpoint?
I thank you in advance for any insight or tips or tricks,
tkjazzer@officeformac.com - 05 Apr 2008 18:11 GMT So it appears it is going to take days for mdimporter to stop working and finish indexing.
It is going at snail speed in the background, but it appears like the not indexing inside .ppt is still going to be a problem.
I have another random question about spotlight.
Before a folder shows up, does every file in the folder have to be indexed?
The one folder that it is indexing that will probably take days to finish... files inside the folder are now showing up but when I type the Name of the folder in to spotlight, it does not show up... odd.
Corentin Cras-Méneur - 05 Apr 2008 20:31 GMT > So it appears it is going to take days for mdimporter to stop working > and finish indexing. > > It is going at snail speed in the background, but it appears like the > not indexing inside .ppt is still going to be a problem. There are ways through the command line to froce-reindex a file or folder and to visualize the Spotlight index for that file, but I would like to stay away from these rather tedious methods.
Use VersionTracker to find Spotlight-related Utilities. I know some of them ca help you check what actually gets indexed for a specific file.
Reindexing could take over nights, but several days seems a little excessive.
> I have another random question about spotlight. > > Before a folder shows up, does every file in the folder have to be > indexed? Nope. A folder only shows up for "name" based searches though.
> The one folder that it is indexing that will probably take days to > finish... files inside the folder are now showing up but when I type the > Name of the folder in to spotlight, it does not show up... odd. It would tell me that there is something screwy with Spotlight on your Mac. Something is holding things up. If I were you, I'd look in /Library/Spotlight and ~/Library/Spotlight to make sure there is nothing there that could conflict (eg: 2 diffeernt versions of the same mdimporter. You can also check in Console.app if there is any trace of error with Spotlight.
make sure you update everything that can be updated on your Mac (Old versions of Stuffit for instance had awful mdimporters that were messing up more or less everything.
Check the Spotlight preferences and uncheck everything that doesn't need to be indexed.
Then you can consider disabling Spotlight and re-enabling it.
Finally you can play around in the Terminal with the mdutil (enable-disable spotlight, force-reindex an entire drive...) and mdimport (to force reindex files, check spotlight index for files...).... try man mdimport and man mdutil
first to learn more about these commands,
Corentin
 Signature --- Mac:MS MVP http://www.cortig.net/wordpress/ --- http://www.mvps.org - http://mvp.support.microsoft.com MVPs are not MS employees - Les MVP ne travaillent pas pour MS Remove "NoSpam" to e-mail me - Retirez "NoSpam" pour m'écrire
tkjazzer@officeformac.com - 06 Apr 2008 23:17 GMT just checked today and now that folder is showing up... giving it time worked for that...
but the rest of my stuff is screwy within the powerpoint files.
I will try to work through your suggestions one at a time. Thank you
tkjazzer@officeformac.com - 06 Apr 2008 23:19 GMT my microsoft office mdimporter icon does not have the "office O symbol" on it. Is that a problem?
tkjazzer@officeformac.com - 06 Apr 2008 23:58 GMT would deleting the mdimporter do anything? would Tiger then fix it and reinstall another?
I've also tried clearing my caches with the app MainMenu but that did not work.
I'm resisting learning terminal but will do if i have to... (haven't tried terminal solutions yet)
It is odd to me that it is only the powerpoint files that do it. word documents, pdfs work... just not ppt
tkjazzer@officeformac.com - 06 Apr 2008 23:59 GMT doubt i will be able to go through each app and update it. I just have too many. Unless there is an easier way of figuring out which apps are out of date.
tkjazzer@officeformac.com - 07 Apr 2008 00:09 GMT OK i keep catching mdimporter doing something in the activity monitor. Can anyone tell me what it is doing? I appears to be working with various .ppt files in the main .ppt folder. Could this be something?
/ /System/Library/Frameworks/CoreServices.framework/Versions/A/Frameworks/Metadata.framework/Versions/A/Support/mdimport /System/Library/CoreServices/CharacterSets/CFUnicodeData-L.mapping /System/Library/CoreServices/CharacterSets/CFCharacterSetBitmaps.bitmap /System/Library/CoreServices/CharacterSets/CFUniCharPropertyDatabase.data /Library/Spotlight/Microsoft Office.mdimporter/Contents/MacOS/Microsoft Office /Library/Caches/com.apple.IntlDataCache.le.sbdl.501 /Library/Caches/com.apple.LaunchServices-014501.csstore /usr/share/icu/icudt32l.dat /usr/lib/dyld /usr/lib/libSystem.B.dylib /System/Library/Frameworks/ApplicationServices.framework/Versions/A/Frameworks/CoreText.framework/Versions/A/CoreText /System/Library/Frameworks/ApplicationServices.framework/Versions/A/Frameworks/ATS.framework/Versions/A/ATS /System/Library/Frameworks/ApplicationServices.framework/Versions/A/Frameworks/CoreGraphics.framework/Versions/A/CoreGraphics /System/Library/Frameworks/CoreFoundation.framework/Versions/A/CoreFoundation /usr/lib/libicucore.A.dylib /usr/lib/libobjc.A.dylib /usr/lib/libstdc++.6.0.4.dylib /usr/lib/libgcc_s.1.dylib /usr/lib/libauto.dylib /System/Library/Frameworks/CoreServices.framework/Versions/A/Frameworks/CarbonCore.framework/Versions/A/CarbonCore /System/Library/Frameworks/CoreServices.framework/Versions/A/Frameworks/Metadata.framework/Versions/A/Metadata /System/Library/Frameworks/Security.framework/Versions/A/Security /System/Library/Frameworks/DiskArbitration.framework/Versions/A/DiskArbitration /System/Library/Frameworks/ApplicationServices.framework/Versions/A/Frameworks/ColorSync.framework/Versions/A/ColorSync /System/Library/Frameworks/ApplicationServices.framework/Versions/A/Frameworks/HIServices.framework/Versions/A/HIServices /System/Library/Frameworks/ApplicationServices.framework/Versions/A/Frameworks/LaunchServices.framework/Versions/A/LaunchServices /System/Library/Frameworks/ApplicationServices.framework/Versions/A/Frameworks/ImageIO.framework/Versions/A/Resources/libJP2.dylib /usr/lib/libxml2.2.dylib /System/Library/PrivateFrameworks/DesktopServicesPriv.framework/Versions/A/DesktopServicesPriv /System/Library/Frameworks/Foundation.framework/Versions/C/Foundation /System/Library/Frameworks/Carbon.framework/Versions/A/Frameworks/SecurityHI.framework/Versions/A/SecurityHI /System/Library/Frameworks/Carbon.framework/Versions/A/Frameworks/OpenScripting.framework/Versions/A/OpenScripting /System/Library/Frameworks/Carbon.framework/Versions/A/Frameworks/HIToolbox.framework/Versions/A/HIToolbox /dev/null /dev/null count=0, state=0x2 /tmp/com.apple.csseed.61 apple.shm.notification_center /Library/Spotlight /System/Library/Spotlight /Users/****USERNAME****/Desktop/***FOLDER NAME***/***SUB FOLDER NAME****/.DS_Store count=0, state=0x2 /Users/****USERNAME****/Desktop/***FOLDER NAME***/***SUB FOLDER NAME****/XXXXX(filename).ppt /Users/****USERNAME****/Desktop/***FOLDER NAME***/***SUB FOLDER NAME****/XXXXX(filename).ppt.ppt
both files above (XXXXX) were the same files.
tkjazzer@officeformac.com - 07 Apr 2008 00:14 GMT Can this issue be solved by the Mac Geniuses at the Mac Stores? Do they charge?
I think my computer has the 3 year apple care but I don't really know what that entitles me to.
Thank you
Corentin Cras-Méneur - 11 Apr 2008 20:15 GMT > Can this issue be solved by the Mac Geniuses at the Mac Stores? Do the > charge? They woudl probably charge, and I don;t know whether they would do that sort of thing for you,
Corentin
 Signature --- Mac:MS MVP http://www.cortig.net/wordpress/ --- http://www.mvps.org - http://mvp.support.microsoft.com MVPs are not MS employees - Les MVP ne travaillent pas pour MS Remove "NoSpam" to e-mail me - Retirez "NoSpam" pour m'écrire
tkjazzer@officeformac.com - 12 Apr 2008 22:27 GMT <http://forums.macrumors.com/showthread.php?p=5302986&posted=1#post5302986>
I've done the terminal command for number 1 which is mdimport -L
I am now trying to find the command for step 2: "2) use the command on a file to see what has been indexed for it"
thank you so much for your help so far. I really appreciate everything. I can't wait until my spotlight actually works properly.
Corentin Cras-Méneur - 16 Apr 2008 15:10 GMT Sorry, but I've been away for a few days.
> <http://forums.macrumors.com/showthread.php?p=5302986&posted=1#post53029 > 86> > > I've done the terminal command for number 1 which is mdimport -L What did you get?? Do you see the proper Office mdimporter listed in the output?? This command lists all recognize mdimporters.
> I am now trying to find the command for step 2: "2) use the command on a > file to see what has been indexed for it" lets' start over:
1) check the list of mdimporter actually recognized by the system
mdimport -L You did this one already and from what I can see form the thread you cited, it is recognized
2) use the command on a file to see what has been indexed for it
mdls <drag your file here> This command should list basic information known about a file. Dragging an Office document here should list a binch of properties. Do you get anything??
If you want to see what words are actually indexed for a specific file, you have to use this command instead: mdimport -n -d2 <drag your file here>
3) force-reindex a file or folder to see if it corrects the problem for the file or folder
mdimport <drag a file or folder here>
4) force-reindex everything if 3 worked. It will run for some time....
mdutil -E / (you might need to authenticate for this one, I'm not sure... If nothing happens, try: sudo mdutil -E / )
The Terminal fills-in paths to files through a simple drag and drop. As I indicated for many of these commands, you can drag files or folders at the end of the command to get the path to the file you are trying to play with. Of course, don;t type in "<drag a file or folder here>", just drag it :-)
Corentin
 Signature --- Mac:MS MVP http://www.cortig.net/wordpress/ --- http://www.mvps.org - http://mvp.support.microsoft.com MVPs are not MS employees - Les MVP ne travaillent pas pour MS Remove "NoSpam" to e-mail me - Retirez "NoSpam" pour m'écrire
tkjazzer@officeformac.com - 18 Apr 2008 00:49 GMT > lets' start over: Ok, I am working on step 2.
> 2) use the command on a file to see what has been indexed for it > > mdls > This command should list basic information known about a file. Dragging > an Office document here should list a binch of properties. Do you get > anything?? I first tried an office 2008 ppt file and got:
kMDItemAttributeChangeDate = 2008-04-16 10:39:04 -0700 kMDItemContentCreationDate = 2008-03-11 00:34:12 -0700 kMDItemContentModificationDate = 2008-04-16 10:38:43 -0700 kMDItemContentType = "com.microsoft.powerpoint.ppt" kMDItemContentTypeTree = ( "com.microsoft.powerpoint.ppt", "public.data", "public.item", "public.presentation", "public.composite-content", "public.content" ) kMDItemDisplayName = "080311_0800_***removedinfo****_drug_induced_liver_slides08.ppt" kMDItemFSContentChangeDate = 2008-04-16 10:38:43 -0700 kMDItemFSCreationDate = 2008-03-11 00:34:12 -0700 kMDItemFSCreatorCode = 1347441715 kMDItemFSFinderFlags = 0 kMDItemFSInvisible = 0 kMDItemFSIsExtensionHidden = 0 kMDItemFSLabel = 0 kMDItemFSName = "080311_0800_***removedinfo****_drug_induced_liver_slides08.ppt" kMDItemFSNodeCount = 0 kMDItemFSOwnerGroupID = 501 kMDItemFSOwnerUserID = 501 kMDItemFSSize = 6161920 kMDItemFSTypeCode = 1397507128 kMDItemID = 816118 kMDItemKind = "Microsoft PowerPoint document" kMDItemLastUsedDate = 2008-04-16 10:38:31 -0700 kMDItemUsedDates = (2008-03-11 00:35:26 -0700, 2008-04-15 17:00:00 -0700)
Then I dragged an office 2008 .doc file and got:
kMDItemAttributeChangeDate = 2008-04-07 18:05:07 -0700 kMDItemAuthors = ("***removedinfo****") kMDItemContentCreationDate = 2007-04-25 20:29:55 -0700 kMDItemContentModificationDate = 2007-04-25 21:13:13 -0700 kMDItemContentType = "com.microsoft.word.doc" kMDItemContentTypeTree = ("com.microsoft.word.doc", "public.data", "public.item") kMDItemDisplayName = "curriculum rep email.doc" kMDItemFSContentChangeDate = 2007-04-25 21:13:13 -0700 kMDItemFSCreationDate = 2007-04-25 20:29:55 -0700 kMDItemFSCreatorCode = 1297307460 kMDItemFSFinderFlags = 0 kMDItemFSInvisible = 0 kMDItemFSIsExtensionHidden = 0 kMDItemFSLabel = 0 kMDItemFSName = "curriculum rep email.doc" kMDItemFSNodeCount = 0 kMDItemFSOwnerGroupID = 501 kMDItemFSOwnerUserID = 501 kMDItemFSSize = 23838 kMDItemFSTypeCode = 1463304782 kMDItemID = 17775 kMDItemKind = "Microsoft Word 97 - 2004 document" kMDItemLastUsedDate = 2007-04-25 21:13:13 -0700 kMDItemTitle = "Hi," kMDItemUsedDates = (2007-04-25 21:13:13 -0700)
Then I dragged a acrobat 8 professional pdf and got:
kMDItemAttributeChangeDate = 2008-04-16 10:37:28 -0700 kMDItemAuthors = ("***removedinfo****") kMDItemContentCreationDate = 2008-04-16 10:35:49 -0700 kMDItemContentModificationDate = 2008-04-16 10:35:50 -0700 kMDItemContentType = "com.adobe.pdf" kMDItemContentTypeTree = ( "com.adobe.pdf", "public.data", "public.item", "public.composite-content", "public.content" ) kMDItemCreator = "Acrobat PDFMaker 8.1 for Word" kMDItemDisplayName = "080311_0800_***removedinfo****_drug_induced_liver_handout08.pdf" kMDItemEncodingApplications = ("Acrobat Distiller 8.1.0 (Windows)") kMDItemFSContentChangeDate = 2008-04-16 10:35:50 -0700 kMDItemFSCreationDate = 2008-04-16 10:35:49 -0700 kMDItemFSCreatorCode
tkjazzer@officeformac.com - 18 Apr 2008 00:53 GMT NO WAY, THE POST ABOVE WAS SO MUCH LONGER.
i'm quite mad at the computer right now.
in summary, step 3 didn't work.
tkjazzer@officeformac.com - 18 Apr 2008 00:55 GMT will come back to this tomorrow or the next day and show you the results of what was indexed for step 2 showing the specific words indexed. so frustrating.
tkjazzer@officeformac.com - 18 Apr 2008 00:56 GMT step 3 didn't work for the .ppt that wasn't appearing fully indexed.
Corentin Cras-Méneur - 18 Apr 2008 18:16 GMT > step 3 didn't work for the .ppt that wasn't appearing fully indexed. Well what does mdimport return?? Obviously, there is a nasty spotlight issue on your Mac :-\ A conflict maybe??
mdimport should be able to import the file. If it fails, then no need re-indexing the entire drive (since mdimport would then do the same thing, on a much larger scale, simply failing over and over again on each and every Office file).
You might want to consider booting on your MacOS X DVD to run DiskUtility from there to fix the drive, You might also want to repair Permissions from Disk Utility (booting from your own drive this time).
Corentin
 Signature --- Mac:MS MVP http://www.cortig.net/wordpress/ --- http://www.mvps.org - http://mvp.support.microsoft.com MVPs are not MS employees - Les MVP ne travaillent pas pour MS Remove "NoSpam" to e-mail me - Retirez "NoSpam" pour m'écrire
tkjazzer@officeformac.com - 18 Apr 2008 18:40 GMT OK, what happen was I made this extremely long post to put on this forum and the post got cut off and I lost like 3/4 of the post. Of course I didn't think of copying the whole thing in case it happened.
So once I need a break from studying I'll try to replicate more of the post.
So Office Word and Acrobat Profession 8 is Indexing EVERYTHING fine.
It is just ppt and pptx files.
I will show you what exactly those files are indexing later and will then proceed if you still suggest on doing the things you mentioned above.
Thank you so much for your time. I will post soon. I just am so frustrated at mactopia for cutting off my post that I need a few more hours before entering terminal land again.
tkjazzer@officeformac.com - 18 Apr 2008 18:45 GMT Oh cool, terminal saved what I did yesterday:
mdimport -n -d2 /Users/***removedinfo****/Desktop/***removedinfo****/10\ -\ G.I.\ Liver/080311_0800_***removedinfo****_drug_induced_liver_slides08.ppt 2008-04-17 16:32:28.830 mdimport[4462] Import '/Users/***removedinfo****/Desktop/***removedinfo****/10 - G.I. Liver/080311_0800_***removedinfo****_drug_induced_liver_slides08.ppt' type 'com.microsoft.powerpoint.ppt' using 'file://localhost/Library/Spotlight/Microsoft%20Office.mdimporter/' 2008-04-17 16:32:34.171 mdimport[4462] Sending attributes of '/Users/***removedinfo****/Desktop/***removedinfo****/10 - G.I. Liver/080311_0800_***removedinfo****_drug_induced_liver_slides08.ppt' to server. Attributes: '{ "_kMDItemImporterCrashed" = <null>; "com_apple_metadata_modtime" = 230060323; kMDItemAuthors = ("***removedinfo****"); kMDItemContentCreationDate = 2008-03-11 00:34:12 -0700; kMDItemContentModificationDate = 2008-04-16 10:38:43 -0700; kMDItemContentType = "com.microsoft.powerpoint.ppt"; kMDItemContentTypeTree = ( "com.microsoft.powerpoint.ppt", "public.data", "public.item", "public.presentation", "public.composite-content", "public.content" ); kMDItemDisplayName = {"" = "080311_0800_***removedinfo****_drug_induced_liver_slides08.ppt"; }; kMDItemKind = {"" = "Microsoft PowerPoint document"; }; \tFXR farnesoid X receptor\nLigand binding domain\nDNA binding domain\n Ligand binding pocket of human SXR isClotrimazole\nBile acids\nCYP3A4*\nCYP2B\nMDR1* (p-gp)\nCYP2C\***removedinfo****GI/Liver\nMRP2\nCYP1A1\nSulfotransferase and UDGPT isozymes\n* Intestinal first pass protection\nRifampin\nPXR\nCYP3A\nCYP2B\nPhenobarbital\nCAAdv Drug Delivery Reviews, 2001\nP\na\nt\nt\ne\nr\nn\nE\nf\nf\ne\nc\nt\n \no\nn\n \nt\na\nr\ng\ne\nt\nc\ne\nl\nl\nI\nn\nf\nl\na\nm\nm\na\nt\ni\no\nn\nD\nr\nu\ng\nM\ni\nc\nr\no\ns\nc\no\np\ni\nc\nc\nh\no\nl\na\nn\ng\ni\nt\ni\ns\nB\ni\nl\ne\n \nd\nu\nc\nt\n \nc\ne\nl\nl\ni\nn\nj\nu\nr\ny\nP\no\nr\nt\na\nl\n \nt\nr\ni\na\nd\nC\nh\nl\no\nr\np\nr\no\nm\na\nz\ni\nn\ne\n;\nE\nr\ny\nt\nh\nr\no\nm\ny\nc\ni\nn\ne\ns\nt\no\nl\na\nt\ne\nB\nl\na\nn\nd\nc\nh\no\nl\ne\ns\nt\na\ns\ni\ns\nI\nn\nh\ni\nb\ni\nt\ni\no\nn\n \no\nf\nh\ne\np\na\nt\no\nc\ny\nt\ne\nt\nr\na\nn\ns\np\no\nr\nt\nN\no\nn\ne\nE\ns\nt\nr\no\ng\ne\nn\n,\nC\ny\nc\nl\no\ns\np\no\nr\ni\nn\n \nA\nHow can wof cholestatic liver injury?\nBile acid independent\nBile acid excretion\nBile Flow\nBile acid dependent\nToxic Drug\nInhibition of bile acid transport (bland cholestasis)\n\Uffb1 to hepatocytes (mixed injury)\nMetabolitGSH\nBile duct cells\***removedinfo**** Div. GI/Liver\nToxic concentrations of bile salts\nBA\nHepatocyte injury (mixedBosentan\nEstradiol-17b-glucuronide\***removedinfo**** Div. GI/Liver\nMRP2\nFlucloxacillin\n5-OH-Methylflucloxacillin\nCYP3A4\nBile Duct Cells\nToxicity to bile duct cells >> hepatocytes\nLakehal et al, Chem Res Toxicol, 2001\nFaSurvivors: serum phosphate <1.2 mmol/l at 48 to 96 hours\nAPAP= acetaminophen; d/c=discontinue; NPO- nothing bmouth; prn=as needed, Rx=prescribe\n"; kMDItemTitle = "Drug-Induced Hepatotoxicity"; }'
tkjazzer@officeformac.com - 18 Apr 2008 18:47 GMT so again, I can't seem to type any words I see within that file and have them show up in the spotlight.
However, I can type a word that appears in the file name and that shows up.
The Office Word documents index fine. The PDFs index fine.
What should I do again?
Can you break it down in to steps?
Thank you!
tkjazzer@officeformac.com - 18 Apr 2008 18:47 GMT I tried to reindex that file. The exact same thing appeared, so nothing changed.
tkjazzer@officeformac.com - 18 Apr 2008 18:49 GMT ok, something did work.
but only some words inside were indexed.
so if you look up. I typed "duct cells" in spotlight which appears to have been indexed, and spotlight showed it.
SO WHY ARE ONLY SOME WORDS INSIDE THE PPT FILES BEING INDEXED WHILE OTHERS ARE NOT?
so frustrating.
tkjazzer@officeformac.com - 18 Apr 2008 18:53 GMT > Well what does mdimport return?? The exact same partial index that it did the first time I checked what was indexed.
> Obviously, there is a nasty spotlight issue on your Mac :-\ yup, very frustrating. don't know what to do about it.
> A conflict maybe?? How do I tell? It only has a problem partially indexing the text within .ppt files. Very few words get indexed.
Microsoft Word and PDF files index the entire thing.
> mdimport should be able to import the file. It imported the exact information that it already had - a partial index... not every word in ppt gets indexed - far from it.
> If it fails, then no need re-indexing the entire drive (since mdimport > would then do the same thing, on a much larger scale, simply failing > over and over again on each and every Office file). What is failure? I mean, it indexed something, but not everything?
And not every office file is the problem - just .ppt
> You might want to consider booting on your MacOS X DVD to run > DiskUtility from there to fix the drive, I have no idea how to do this
> You might also want to repair Permissions from Disk Utility (booting > from your own drive this time). I have no idea how to do this.
Thank you so much for your time and help,
Corentin Cras-Méneur - 18 Apr 2008 19:18 GMT It might not be related to your problem after all, but these two tips are worth knowing about
> > You might want to consider booting on your MacOS X DVD to run > > DiskUtility from there to fix the drive, > > I have no idea how to do this only repair a tiger drive with a Tiger DVD, a leopard drive with a leaopard dvd etc... put the DVD in the DVD drive, launch the System Preferences and change the startup options to boot from the DVD after the regular installationdialogs (select your language, etc), you shoudl see the menu bar appear. Of course, DON'T select to reinstall your system. All you need to do is to get to the first dialog with the menu bar, In the options in the menu bar, you can find disk utility. Select it to launch it. In the application, select your internal drive and hit "repair" Once you are done, go back to the menu bar to find the startup preferences. Reselect your internal drive and reboot on it.
> > You might also want to repair Permissions from Disk Utility (booting > > from your own drive this time). > > I have no idea how to do this. > > Thank you so much for your time and help, On your computer, launch the Disk Utility application (/Applications/Utilities). Select your hard drive on the left and click the Repair Permissions button.
These two trick can correct quite a few problems on your Mac. they are worth running every once in a while.
Corentin
 Signature --- Mac:MS MVP http://www.cortig.net/wordpress/ --- http://www.mvps.org - http://mvp.support.microsoft.com MVPs are not MS employees - Les MVP ne travaillent pas pour MS Remove "NoSpam" to e-mail me - Retirez "NoSpam" pour m'écrire
tkjazzer@officeformac.com - 18 Apr 2008 19:34 GMT Most of my ppt files are huge! (50 mb+)
however, I specifically chose this one because it was mainly ALL text, even though like 90+ slides of text.
This ppt file is only 5.9 mb.
So yes,
to clarify. EVERY word in WORD files are being indexed.
Only SOME words in PPT files are being indexed.
The same words in the PPT files were indexed before and after reindexing that file.
What should I do? tell microsoft? tell apple? ask a mac genius?
Corentin Cras-Méneur - 18 Apr 2008 22:20 GMT > to clarify. > EVERY word in WORD files are being indexed. [quoted text clipped - 8 lines] > tell apple? > ask a mac genius? I doubt the Mac Genius will be able to do anything for you since this looks like a limitation on the mdimporter. You can use the Send Feedback command in PPT to let MS know what's going on, but don;t expect a reply (though it will matter since it will let MS know there is a problem: you can't fix a bug if you don;t know it exists).
I tried to escalade the information to the contacts we have at MS and I hope it will get noticed...
Corentin
 Signature --- Mac:MS MVP http://www.cortig.net/wordpress/ --- http://www.mvps.org - http://mvp.support.microsoft.com MVPs are not MS employees - Les MVP ne travaillent pas pour MS Remove "NoSpam" to e-mail me - Retirez "NoSpam" pour m'écrire
tkjazzer@officeformac.com - 27 Apr 2008 05:22 GMT what should I do now?
Corentin Cras-Méneur - 28 Apr 2008 17:48 GMT > what should I do now? There is nothing you can do but wait for MS and Apple to provide a better indexing mechanism :-(
Corentin
 Signature --- Mac:MS MVP http://www.cortig.net/wordpress/ --- http://www.mvps.org - http://mvp.support.microsoft.com MVPs are not MS employees - Les MVP ne travaillent pas pour MS Remove "NoSpam" to e-mail me - Retirez "NoSpam" pour m'écrire
Corentin Cras-Méneur - 18 Apr 2008 19:18 GMT > ok, something did work. > [quoted text clipped - 5 lines] > SO WHY ARE ONLY SOME WORDS INSIDE THE PPT FILES BEING INDEXED WHILE > OTHERS ARE NOT? Well, as I was saying in another post, either the file is corrupted, or the mdimporter is buggy. It could also be that the mdimporter stops importing after a certain size in the file to avoid over-crowding the index (that would be sad... but it is a possibility) That's the only explanation that comes to my mind.
Reindexing could help, (since you saw some improvement here), but it doesn't look like it will properly entirely index your files.
Out of curiosity... Are your files big?? Do you start of with a bunch of graphics?? I'm really starting to wonder wehther the mdimporter could simply stop indexing after reaching some sort of pre-defined limit...
Corentin
 Signature --- Mac:MS MVP http://www.cortig.net/wordpress/ --- http://www.mvps.org - http://mvp.support.microsoft.com MVPs are not MS employees - Les MVP ne travaillent pas pour MS Remove "NoSpam" to e-mail me - Retirez "NoSpam" pour m'écrire
Corentin Cras-Méneur - 18 Apr 2008 19:18 GMT > so again, I can't seem to type any words I see within that file and have > them show up in the spotlight. > > However, I can type a word that appears in the file name and that shows > up. So if I'm getting this right: - words appearing in the index are fine, but not everything is indexed?? That's not a failure, it's more likely to be a bug in the importer (or a corruption in the file)
Corentin
 Signature --- Mac:MS MVP http://www.cortig.net/wordpress/ --- http://www.mvps.org - http://mvp.support.microsoft.com MVPs are not MS employees - Les MVP ne travaillent pas pour MS Remove "NoSpam" to e-mail me - Retirez "NoSpam" pour m'écrire
Corentin Cras-Méneur - 18 Apr 2008 19:18 GMT > Oh cool, terminal saved what I did yesterday: This is for a ppt file. It is properly indexed (including content)
Corentin
 Signature --- Mac:MS MVP http://www.cortig.net/wordpress/ --- http://www.mvps.org - http://mvp.support.microsoft.com MVPs are not MS employees - Les MVP ne travaillent pas pour MS Remove "NoSpam" to e-mail me - Retirez "NoSpam" pour m'écrire
Corentin Cras-Méneur - 18 Apr 2008 19:18 GMT > OK, what happen was I made this extremely long post to put on this > forum and the post got cut off and I lost like 3/4 of the post. Of course I > didn't think of copying the whole thing in case it happened. Hehehe, that's why I always use a dedicated newsreader for newsgroups instead of a Web Interface ;-)
> So once I need a break from studying I'll try to replicate more of the post. > > So Office Word and Acrobat Profession 8 is Indexing EVERYTHING fine. > > It is just ppt and pptx files. I just played around with a pptx file. I got: Corentin:~ corentin$ mdimport -n -d2 /Volumes/Gloubi/Users/me/Documents/Office\ Projects/Présentations/Beta\ cell\ regeneration\ review.pptx 2008-04-18 13:03:22.752 mdimport[20552:10b] Imported '/Volumes/Gloubi/Users/me/Documents/Office Projects/Présentations/Beta cell regeneration review.pptx' of type 'org.openxmlformats.presentationml.presentation' with no plugIn. 2008-04-18 13:03:22.755 mdimport[20552:10b] Attributes: { "_kMDItemFinderLabel" = <null>; "com_apple_metadata_modtime" = 223533394; kMDItemContentCreationDate = 2008-01-31 22:36:34 -0600; kMDItemContentModificationDate = 2008-01-31 22:36:34 -0600; kMDItemContentType = "org.openxmlformats.presentationml.presentation"; kMDItemContentTypeTree = ( "org.openxmlformats.presentationml.presentation", "org.openxmlformats.openxml", "public.zip-archive", "com.pkware.zip-archive", "public.data", "public.item", "com.apple.bom-archive", "public.archive", "public.presentation", "public.composite-content", "public.content" ); kMDItemDisplayName = { "" = "Beta cell regeneration review.pptx"; }; kMDItemKind = { "" = "Microsoft PowerPoint presentation"; }; }
As you can see, the file is indexed, but there isn't any content indexation. That's not a bug though, it's a limitation. The mdimporter doesn't index the content. I believe that even though the mdimporter is made by MS, it actually ships through Apple with the System Any improvement in this respect could only come through Apple.
Corentin
 Signature --- Mac:MS MVP http://www.cortig.net/wordpress/ --- http://www.mvps.org - http://mvp.support.microsoft.com MVPs are not MS employees - Les MVP ne travaillent pas pour MS Remove "NoSpam" to e-mail me - Retirez "NoSpam" pour m'écrire
Corentin Cras-Méneur - 11 Apr 2008 20:15 GMT > OK i keep catching mdimporter doing something in the activity monitor. > Can anyone tell me what it is doing? I appears to be working with > various .ppt files in the main .ppt folder. Could this be something? The log here doesn't tell me anything, It looks like the only way to find out more is to use the command line tools I previously mentioned. As I am away from home now, I can't really give you more details about them though but using the "man" command should provide you with plenty of details.
I don;t remember by heart how to use mdimport and mdutil, but the idea would be to: 1) check the list of mdimporter actually recognized by the system 2) use the command on a file to see what has been indexed for it 3) force-reindex a file or folder to see if it corrects the problem for the file or folder 4) force-reindex everything if 3 worked. It will run for some time....
Corentin
Out of memory, try:
mdimport -L This should list the mdimporter recognized on your system
 Signature --- Mac:MS MVP http://www.cortig.net/wordpress/ --- http://www.mvps.org - http://mvp.support.microsoft.com MVPs are not MS employees - Les MVP ne travaillent pas pour MS Remove "NoSpam" to e-mail me - Retirez "NoSpam" pour m'écrire
|
|
|