tr.com
robots.txt

Robots Exclusion Standard data for tr.com

Resource Scan

Scan Details

Site Domain tr.com
Base Domain tr.com
Scan Status Ok
Last Scan2024-06-12T10:58:12+00:00
Next Scan 2024-07-12T10:58:12+00:00

Last Scan

Scanned2024-06-12T10:58:12+00:00
URL https://tr.com/robots.txt
Redirect https://www.thomsonreuters.com/robots.txt
Redirect Domain www.thomsonreuters.com
Redirect Base thomsonreuters.com
Domain IPs 155.46.172.255
Redirect IPs 2600:9000:2721:2400:1b:b66f:bac0:93a1, 2600:9000:2721:5800:1b:b66f:bac0:93a1, 2600:9000:2721:7400:1b:b66f:bac0:93a1, 2600:9000:2721:7c00:1b:b66f:bac0:93a1, 2600:9000:2721:8a00:1b:b66f:bac0:93a1, 2600:9000:2721:a00:1b:b66f:bac0:93a1, 2600:9000:2721:ca00:1b:b66f:bac0:93a1, 2600:9000:2721:fe00:1b:b66f:bac0:93a1, 3.165.102.48, 3.165.102.7, 3.165.102.70, 3.165.102.85
Response IP 3.165.102.85
Found Yes
Hash ee317a2361ab243927842bbe9be0adc1cb1b35b63b9a439545222c0b91b7529a
SimHash a10183ca7f56

Groups

*

Rule Path
Disallow /*?*
Allow /*js*
Allow /*css*
Disallow /404.html
Disallow /500.html
Disallow /en/60279341234.html
Disallow /en/60279341234
Disallow /content/ewp-marketing-websites
Disallow /content/ciam-user-profile
Disallow /*/profile/email-verification

Other Records

Field Value
sitemap https://www.thomsonreuters.com/en/sitemap.xml
sitemap https://www.thomsonreuters.com/en-us/posts/sitemap_index.xml

Comments

  • Global robots config
  • robots.txt for http://thomsonreuters.com/