Geeks With Blogs

News invisible hit counter View My Stats

Technical Speaking

 

Background:

At Brain Station-23, One of my customer requirement was to configure PDF Content Search so that they can search PDF by Text using search web part. After doing some googling, I found that PDF content can be search through Adobe PDF iFilter 9. I found a complete Guideline at http://support.microsoft.com/kb/2293357 

The similar instructions also given at Code Project with some screen shots step by step which is very much helpful to understand

http://www.codeproject.com/Articles/82546/How-to-configure-PDF-iFilter-for-SharePoint-Server

For Testing purpose I have configured this in my VM (Standalone –Single Farm) and it works fine as expected.

What is the Problem?

The Problem arises when I tried to configure it in Live Environment (Multi-Farm). My Production Servers were consist of 
2 Front End Server (SP 2010 Server Farm 1 & SP 2010 Server Farm 2) running under NBL with 2 SQL2008 R2 Storage (Clustered)

I didn’t find the exact solution anywhere in the Internet even after posting in the forum. But Thanks to all who tried to give me the solution and I got some clues from their answer. You can see my Final answer at http://social.msdn.microsoft.com/Forums/en-US/sharepointgeneralprevious/thread/2d4470ef-fd0a-4e12-8886-68837865dfba as solution. Let me Explain here all Together Once again to make your life easier.

Solution(Single Farm):

There is not much complex for Standalone Server or Single Farm. You can just follow the Instruction Straight Forward as written as http://support.microsoft.com/kb/2293357.

OR

My suggestion is to use “Powershell Script for configuring PDF ifilter for Sharepoint 2010” which is much easier. If you are not sure and confident with manually configuration as above. You can use the power shell script for out of box automatically configuration from configure PDF Icon to complete PDF iFilter installation.

Download the powershell script here https://docs.google.com/open?id=0B55FfAMp1BXdMDRhNzQyZjktM2Y5NS00MjY4LTkwYjUtY2QwOWUxOTQ3MzY0

Reference:

http://nhutcmos.wordpress.com/2011/11/04/full-text-search-pdf-content-in-sharepoint-2010/

Solution(Multi Farm):

Here, You can also use the same power shell script for out of box automatically configuration from configure PDF Icon to complete PDF iFilter installation. Download the PowerShell script here https://docs.google.com/open?id=0B55FfAMp1BXdMDRhNzQyZjktM2Y5NS00MjY4LTkwYjUtY2QwOWUxOTQ3MzY0. Thanks to him who did this great job for us. If you use PowerShell script then skip (Already in PowerShell Script) part form the following steps (manual) since some of them already covered in the script.

But if you want to do manually then follow the steps as I written below:

1. Configuring (Step 2 to Step 12) Adobe PDF iFilter 9 for 64-bit platforms http://support.microsoft.com/kb/2293357 except Re-Start Computer

(Already in PowerShell Script)

2. Verify that the following registry subkeys are present and that they contain the appropriate values. If they are not present, you can manually create them

HKEY_LOCAL_MACHINE\SOFTWARE\Microsoft\Shared Tools\Web Server Extensions\14.0\Search\Setup\ContentIndexCommon\Filters\Extension\

Verify .pdf subkey exists or not, if it doesn’t right click on Extensions and create a Subkey

Add the following values to this key:

<REG_SZ> Default =<value not set>
< REG_SZ> Extension = .pdf
< REG_DWORD> FileTypeBucket = 1
< REG_SZ> MimeTypes = application/pdf

(Already in PowerShell Script)

2013-05-06_1914

3. It is recommended that you set your system PATH environment variable to the "bin" folder of the "iFilter" installation. For example, from the "Control Panel\System\Advanced\Environment Variables" tab, append to PATH "C:\Program Files\Adobe\Adobe PDF iFilter 9 for 64-bit platforms\bin\"
and then restart the computer. Reference: http://www.adobe.com/support/downloads/detail.jsp?ftpID=4025

(Already in PowerShell Script)

Ref: http://www.adobe.com/support/downloads/detail.jsp?ftpID=4025

4. Open Windows Explorer and navigate to the following location:
C:\Program Files\Adobe\Adobe PDF iFilter 9 for 64-bit platforms\bin

Add your Search Service Application’s Content Access Account to the list of users and groups who have security access to this folder.  Make sure the account has Read, Read & Execute, and List Folder Contents permissions

Ref: http://www.dontpapanic.com/blog/?p=224

5. Re-register the adobe iFilter dll using PowerShell

regsvr32.exe"C:\Program Files\Adobe\Adobe PDF iFilter 9 for 64-bit platforms\bin\PDFFilter.dll"

(Already in PowerShell Script)

6. Restart the SharePoint Server Search 14 in Every Farm (Enable/Run if  Server Search 14 is Disable in other Server Farm, In my case Search Server was disable in Farm 2)

(Already in PowerShell Script)

2013-05-06_1910

7. Reboot the SharePoint servers in Farm

8. Reset the content index (Optional)

2013-05-06_1908

9. Start Full Crawling

2013-05-06_1919

10. Search pdf Content should find the right PDF as same as below.

2013-05-06_1839

Cheers!!

 

All References:

1) MS Forum discussion at http://social.msdn.microsoft.com/Forums/en-US/sharepointgeneralprevious/thread/2d4470ef-fd0a-4e12-8886-68837865dfba
where i provided the final answer

2) http://nhutcmos.wordpress.com/2011/11/04/full-text-search-pdf-content-in-sharepoint-2010/

3) http://blogs.msdn.com/b/sameersurve/archive/2012/05/15/sharepoint-2010-cannot-crawl-pdf-files.aspx

4) http://profadmins.com/2012/03/20/adobe-pdf-ifilter-indexing-with-sharepoint-2010/

5) http://geekswithblogs.net/leonardm/archive/2010/09/10/configuring-sharepoint-2010-to-return-pdf-files-in-search-results.aspx

Posted on Monday, May 6, 2013 3:48 AM | Back to top


Comments on this post: SharePoint 2010 Search - PDF content search configuration in multi-farm Environment

No comments posted yet.
Your comment:
 (will show your gravatar)


Copyright © MJ Ferdous | Powered by: GeeksWithBlogs.net