support.office.com
robots.txt

Robots Exclusion Standard data for support.office.com

Resource Scan

Scan Details

Site Domain support.office.com
Base Domain office.com
Scan Status Ok
Last Scan2024-05-29T12:38:41+00:00
Next Scan 2024-06-28T12:38:41+00:00

Last Scan

Scanned2024-05-29T12:38:41+00:00
URL https://support.office.com/robots.txt
Domain IPs 23.222.131.217, 2600:1417:3f:b88::882, 2600:1417:3f:ba5::882
Response IP 184.28.158.251
Found Yes
Hash 64e82b2f4717335fa3b43c589d2b8f7fdb94ac88f70529ad1bd7aff2e96cb0d7
SimHash 30923919edf0

Groups

*

Rule Path
Disallow */client/
Disallow */Client/
Disallow */community?
Disallow */Community?
Disallow */results.aspx
Disallow */Results.aspx
Disallow */results?
Disallow */Results?
Disallow /f1/
Disallow /F1/
Disallow /*/f1/
Disallow /*/F1/

baiduspider

Rule Path
Disallow */client/
Disallow */Client/
Disallow */community?
Disallow */Community?
Disallow */results.aspx
Disallow */Results.aspx
Disallow */results?
Disallow */Results?
Disallow /f1/
Disallow /F1/
Disallow /*/f1/
Disallow /*/F1/
Disallow */sitemap
Allow /zh-cn/sitemap
Disallow /bg-*/
Disallow /cs-*/
Disallow /da-*/
Disallow /el-*/
Disallow /et-*/
Disallow /he-*/
Disallow /hi-*/
Disallow /hr-*/
Disallow /hu-*/
Disallow /id-*/
Disallow /it-*/
Disallow /ja-*/
Disallow /kk-*/
Disallow /ko-*/
Disallow /lt-*/
Disallow /lv-*/
Disallow /ms-*/
Disallow /nb-*/
Disallow /pl-*/
Disallow /ro-*/
Disallow /ru-*/
Disallow /sk-*/
Disallow /sl-*/
Disallow /sr-latn-*/
Disallow /sv-*/
Disallow /th-*/
Disallow /tr-*/
Disallow /uk-*/
Disallow /vi-*/
Disallow /fi-*/
Disallow /ar-*/
Disallow /de-*/
Disallow /es-*/
Disallow /fr-*/
Disallow /nl-*/
Disallow /pt-*/
Disallow /en-*/
Allow /en-us/

Other Records

Field Value
sitemap https://support.office.com/sitemapcollection

Comments

  • Specify directives for all agents
  • Disallow all other SOC sitemaps except the one for Baidu
  • Disallow the crawl of locales other than zh-* and en-us