support.office.com
robots.txt

Robots Exclusion Standard data for support.office.com

Resource Scan

Scan Details

Site Domain support.office.com
Base Domain office.com
Scan Status Ok
Last Scan2024-08-27T12:39:50+00:00
Next Scan 2024-09-26T12:39:50+00:00

Last Scan

Scanned2024-08-27T12:39:50+00:00
URL https://support.office.com/robots.txt
Domain IPs 23.203.75.132, 2600:1413:b000:686::882, 2600:1413:b000:68c::882
Response IP 23.222.131.217
Found Yes
Hash 64e82b2f4717335fa3b43c589d2b8f7fdb94ac88f70529ad1bd7aff2e96cb0d7
SimHash 30923919edf0

Groups

*

Rule Path
Disallow */client/
Disallow */Client/
Disallow */community?
Disallow */Community?
Disallow */results.aspx
Disallow */Results.aspx
Disallow */results?
Disallow */Results?
Disallow /f1/
Disallow /F1/
Disallow /*/f1/
Disallow /*/F1/

baiduspider

Rule Path
Disallow */client/
Disallow */Client/
Disallow */community?
Disallow */Community?
Disallow */results.aspx
Disallow */Results.aspx
Disallow */results?
Disallow */Results?
Disallow /f1/
Disallow /F1/
Disallow /*/f1/
Disallow /*/F1/
Disallow */sitemap
Allow /zh-cn/sitemap
Disallow /bg-*/
Disallow /cs-*/
Disallow /da-*/
Disallow /el-*/
Disallow /et-*/
Disallow /he-*/
Disallow /hi-*/
Disallow /hr-*/
Disallow /hu-*/
Disallow /id-*/
Disallow /it-*/
Disallow /ja-*/
Disallow /kk-*/
Disallow /ko-*/
Disallow /lt-*/
Disallow /lv-*/
Disallow /ms-*/
Disallow /nb-*/
Disallow /pl-*/
Disallow /ro-*/
Disallow /ru-*/
Disallow /sk-*/
Disallow /sl-*/
Disallow /sr-latn-*/
Disallow /sv-*/
Disallow /th-*/
Disallow /tr-*/
Disallow /uk-*/
Disallow /vi-*/
Disallow /fi-*/
Disallow /ar-*/
Disallow /de-*/
Disallow /es-*/
Disallow /fr-*/
Disallow /nl-*/
Disallow /pt-*/
Disallow /en-*/
Allow /en-us/

Other Records

Field Value
sitemap https://support.office.com/sitemapcollection

Comments

  • Specify directives for all agents
  • Disallow all other SOC sitemaps except the one for Baidu
  • Disallow the crawl of locales other than zh-* and en-us