thomsonreuters.com
robots.txt

Robots Exclusion Standard data for thomsonreuters.com

Resource Scan

Scan Details

Site Domain thomsonreuters.com
Base Domain thomsonreuters.com
Scan Status Ok
Last Scan2024-10-30T21:05:29+00:00
Next Scan 2024-11-29T21:05:29+00:00

Last Scan

Scanned2024-10-30T21:05:29+00:00
URL https://thomsonreuters.com/robots.txt
Redirect https://www.thomsonreuters.com/robots.txt
Redirect Domain www.thomsonreuters.com
Redirect Base thomsonreuters.com
Domain IPs 155.46.172.255
Redirect IPs 2600:9000:2721:1400:1b:b66f:bac0:93a1, 2600:9000:2721:2400:1b:b66f:bac0:93a1, 2600:9000:2721:3600:1b:b66f:bac0:93a1, 2600:9000:2721:4200:1b:b66f:bac0:93a1, 2600:9000:2721:5a00:1b:b66f:bac0:93a1, 2600:9000:2721:6e00:1b:b66f:bac0:93a1, 2600:9000:2721:c00:1b:b66f:bac0:93a1, 2600:9000:2721:ce00:1b:b66f:bac0:93a1, 3.165.102.48, 3.165.102.7, 3.165.102.70, 3.165.102.85
Response IP 3.165.102.48
Found Yes
Hash ee317a2361ab243927842bbe9be0adc1cb1b35b63b9a439545222c0b91b7529a
SimHash a10183ca7f56

Groups

*

Rule Path
Disallow /*?*
Allow /*js*
Allow /*css*
Disallow /404.html
Disallow /500.html
Disallow /en/60279341234.html
Disallow /en/60279341234
Disallow /content/ewp-marketing-websites
Disallow /content/ciam-user-profile
Disallow /*/profile/email-verification

Other Records

Field Value
sitemap https://www.thomsonreuters.com/en/sitemap.xml
sitemap https://www.thomsonreuters.com/en-us/posts/sitemap_index.xml

Comments

  • Global robots config
  • robots.txt for http://thomsonreuters.com/