We have been setting up SharePoint 2013 Search and crawling SharePoint 2010 content sources (SP2010 web apps in another farm). We have done the configuration and setup and are able to do a crawl of SP2010 content sources. However, the crawl logs shows many errors and warnings, and we found that only 30% of the overall data was crawled. One of the major issues in the logs that we are targeting to resolve is as under:
Crawler:Content Plugin High CSSFeedersManager::session_CallbackReceived: Document processing returned a warning. Error messages: Error parsing document 'http://sp2010webapp/sites/Document Library/3.pptx'. There is no format handler able to parse documents of the format 'encoffmetro'.
The above error is shown for a .pptx however the same error occurs for other file formats (xlsx, docx, etc) too.
I have no idea what is encoffmetro and trying to search for the same came across the following post.
As mentioned, I checked the UseIFilter setting for the various file types and found that it was set to true for almost all the file types. I am not sure how this was set to true as the default value should have been false as per MSDN, while we did not modify this setting earlier. Also, we do not have any third party Ifilters installed.
So, as suggested in the post tried updating the UseIFilter = False for errored file types, did a Search Host Service restart, index reset and did a full crawl. However, it still resulted in the same exact errors. While checking the setting for UseIFilter for the changed file types we noticed that it still shows UseIFilter = True.
The setting for UseIFilter is not getting applied while we are not sure if this resolve our issue.
Anyone faced similar issues? Thanks in advance.
- Edited by Devendra Singh 23 hours 16 minutes ago