TechNet
Products
IT Resources
Downloads
Training
Support
Products
Windows
Windows Server
System Center
Microsoft Edge
Office
Office 365
Exchange Server
SQL Server
SharePoint Products
Skype for Business
See all products »
Resources
Channel 9 Video
Evaluation Center
Learning Resources
Microsoft Tech Companion App
Microsoft Technical Communities
Microsoft Virtual Academy
Script Center
Server and Tools Blogs
TechNet Blogs
TechNet Flash Newsletter
TechNet Gallery
TechNet Library
TechNet Magazine
TechNet Wiki
Windows Sysinternals
Virtual Labs
Solutions
Networking
Cloud and Datacenter
Security
Virtualization
Updates
Service Packs
Security Bulletins
Windows Update
Trials
Windows Server 2016
System Center 2016
Windows 10 Enterprise
SQL Server 2016
See all trials »
Related Sites
Microsoft Download Center
Microsoft Evaluation Center
Drivers
Windows Sysinternals
TechNet Gallery
Training
Expert-led, virtual classes
Training Catalog
Class Locator
Microsoft Virtual Academy
Free Windows Server 2012 courses
Free Windows 8 courses
SQL Server training
Microsoft Official Courses On-Demand
Certifications
Certification overview
Special offers
MCSE Cloud Platform and Infrastructure
MCSE: Mobility
MCSE: Data Management and Analytics
MCSE Productivity
Other resources
Microsoft Events
Exam Replay
Born To Learn blog
Find technical communities in your area
Azure training
Official Practice Tests
Support options
For business
For developers
For IT professionals
For technical support
Support offerings
More support
Microsoft Premier Online
TechNet Forums
MSDN Forums
Security Bulletins & Advisories
Not an IT pro?
Microsoft Customer Support
Microsoft Community Forums
Sign in
Home
Library
Wiki
Learn
Gallery
Downloads
Support
Forums
Blogs
Resources For IT Professionals
United States (English)
Россия (Pусский)
中国(简体中文)
Brasil (Português)
Skip to locale bar
Editing: SharePoint 2013: Crawl Scaling Recommendations
Wiki
>
TechNet Articles
>
SharePoint 2013: Crawl Scaling Recommendations
Article
History
Title
<html> <body> <p>In SharePoint 2013 Search, crawling, filtering, and indexing are no longer tied to a single component (i.e. the Crawler in SharePoint 2010). The Crawl Component in 2013 is only responsible for downloading documents ("gathering") and feeding these to the Content Process Component(s). <br> <br> By offloading the filtering ("content processing") and indexing tasks, the crawler is no longer I/O or CPU intensive, and does not need to be scaled past a couple of components (for fault tolerance and network throughput to content sources). With 2x1Gbit/s connections, content farms are likely to be the bottleneck, rather than the crawler itself. Host distribution rules are also gone (see <a href="http://blogs.msdn.com/b/sharepoint_strategery/archive/2013/06/30/why-host-distribution-rules-dont-apply-to-sharepoint-2013.aspx"> http://blogs.msdn.com/b/sharepoint_strategery/archive/2013/06/30/why-host-distribution-rules-dont-apply-to-sharepoint-2013.aspx</a>), but due to the new search architecture, not needed either. Likewise, with the architectural changes, Crawl DBs are added just for content volume now, not crawl performance.</p> <p>This will sound strange to those coming from a SharePoint Search background (e.g. <a href="http://blogs.msdn.com/b/russmax/archive/2010/04/16/search-2010-architecture-and-scale-part-1-crawl.aspx"> http://blogs.msdn.com/b/russmax/archive/2010/04/16/search-2010-architecture-and-scale-part-1-crawl.aspx</a>), but familiar to those coming from FAST Search for SharePoint 2010. As in FS4SP, crawl performance is scaled up primarily by increasing the number of Content Processing Components (analogous to procservers in FS4SP, with contentdistributor and indexingdispatcher functionally rolled in). The CPCs also scale on their own, based on CPU availability, up to a limit (default is good for up to 12 cores - <a href="http://technet.microsoft.com/en-us/library/cc262787.aspx#Search">http://technet.microsoft.com/en-us/library/cc262787.aspx#Search</a>). <br> For most SharePoint content, a CPC will process 5-10 items per second per core. So for example, on an 8 core server, with an Admin Component, Crawl Component, and a Content Processing Component (with ~6 cores to itself), you might see a crawl rate of ~45 items per second (e.g. 6 cores at an average 7.5 items per second), assuming no content source or index bottleneck.</p> <p>The 2013 Index Component builds the shadow index, does merging, and propagates the index journal to replicas (other Index Components in the same Partition). So the I/O considerations for this component are more in line with FS4SP's rather than SharePoint 2010's indexing, since there is no longer a property store DB, and all indexes are built & stored locally by the index component. During small crawls/shadow index builds, the Index Component utilizes small writes (~256kB) sustained at a rate of 100 IOPS. For handling queries, the component utilizes small reads (~64kB), with about 30 IOPS per query. To support 10 QPS at low latency for example, the storage subsystem would need to be capable of 300 IOPS for 64kB reads. <br> <br> If a crawl happens to change more than 10% of current indexed items, a master merge will be triggered, leading to large reads & writes by the indexer (~100MB per operation), which can cause the performance of both small writes (shadow index) and small reads (queries) to drop. For this reason, the documented recommendation of a separate disk (<a href="http://technet.microsoft.com/en-us/library/jj219628.aspx">http://technet.microsoft.com/en-us/library/jj219628.aspx</a>) is really more of a requirement for production environment, to ensure that performance is acceptable even during master merge.</p> <p>So to the summarize the 2013 crawl performance scaling story:</p> <ol> <li><strong>Response time</strong> from content sources and network bandwidth to content sources </li><li><strong>CPU resources</strong> for the Content Processing Component </li><li><strong>I/O resources</strong> for the Index Component </li></ol> <br> The I/O requirements described above lower than in FAST Search for SharePoint 2010. The new minimums are as follows: <ul> <li><span style="font-family:Calibri">256 KB write – 100 IOPS [shadow index]</span> </li><li><span style="font-family:Calibri">64 KB read – 300 IOPS [10 queries per second]</span> </li><li><span style="font-family:Calibri">100 MB read – 200 MB/s [master merge]</span> </li><li><span style="font-family:Calibri">100 MB write – 200 MB/s [master merge]</span> </li></ul> <p><span style="font-family:Calibri"><br> The TechNet documentation related to this topic in 2010 can be found here: <a href="http://technet.microsoft.com/en-us/library/gg604775(v=office.14).aspx"> http://technet.microsoft.com/en-us/library/gg604775(v=office.14).aspx</a><br> </span></p> </body> </html>
Comment
Tags
Please add 1 and 7 and type the answer here: