thomsonreuters.com
robots.txt

Robots Exclusion Standard data for thomsonreuters.com

Resource Scan

Scan Details

Site Domain thomsonreuters.com
Base Domain thomsonreuters.com
Scan Status Ok
Last Scan2024-05-03T21:03:35+00:00
Next Scan 2024-06-02T21:03:35+00:00

Last Scan

Scanned2024-05-03T21:03:35+00:00
URL https://thomsonreuters.com/robots.txt
Redirect https://www.thomsonreuters.com/robots.txt
Redirect Domain www.thomsonreuters.com
Redirect Base thomsonreuters.com
Domain IPs 155.46.172.255
Redirect IPs 13.226.120.118, 13.226.120.36, 13.226.120.84, 13.226.120.94, 2600:9000:229f:1c00:1b:b66f:bac0:93a1, 2600:9000:229f:3200:1b:b66f:bac0:93a1, 2600:9000:229f:4c00:1b:b66f:bac0:93a1, 2600:9000:229f:6c00:1b:b66f:bac0:93a1, 2600:9000:229f:7e00:1b:b66f:bac0:93a1, 2600:9000:229f:9e00:1b:b66f:bac0:93a1, 2600:9000:229f:aa00:1b:b66f:bac0:93a1, 2600:9000:229f:c200:1b:b66f:bac0:93a1
Response IP 13.33.30.36
Found Yes
Hash ee317a2361ab243927842bbe9be0adc1cb1b35b63b9a439545222c0b91b7529a
SimHash a10183ca7f56

Groups

*

Rule Path
Disallow /*?*
Allow /*js*
Allow /*css*
Disallow /404.html
Disallow /500.html
Disallow /en/60279341234.html
Disallow /en/60279341234
Disallow /content/ewp-marketing-websites
Disallow /content/ciam-user-profile
Disallow /*/profile/email-verification

Other Records

Field Value
sitemap https://www.thomsonreuters.com/en/sitemap.xml
sitemap https://www.thomsonreuters.com/en-us/posts/sitemap_index.xml

Comments

  • Global robots config
  • robots.txt for http://thomsonreuters.com/