jp.reuters.com
robots.txt

Robots Exclusion Standard data for jp.reuters.com

Archived Snapshots

Resource Scan

Scan Details

Site Domain	jp.reuters.com
Base Domain	reuters.com
Scan Status	Ok
Last Scan	2024-06-27T02:09:01+00:00
Next Scan	2024-07-04T02:09:01+00:00

Last Scan

Scanned	2024-06-27T02:09:01+00:00
URL	https://jp.reuters.com/robots.txt
Domain IPs	104.88.70.33, 104.88.70.8, 2600:1413:b000:14::b857:c145, 2600:1413:b000:14::b857:c150
Response IP	125.56.219.75
Found	Yes
Hash	0e8e0fd257fcbb1d7b67edd156b1caf2bfecb04b7c218c8101ba1f11bb55a32c
SimHash	140a99408d92

Groups

piplbot

Rule	Path
Disallow	/

Rule

Path

Disallow

/

ccbot

Rule	Path
Disallow	/

Rule

Path

Disallow

/

anthropic-ai

Rule	Path
Disallow	/

Rule

Path

Disallow

/

claude-web

Rule	Path
Disallow	/

Rule

Path

Disallow

/

google-extended

Rule	Path
Disallow	/

Rule

Path

Disallow

/

facebookbot

Rule	Path
Disallow	/

Rule

Path

Disallow

/

*

Rule	Path
Disallow	/site-search/
Disallow	/test/

Rule

Path

Disallow

/site-search/

Disallow

/test/

Back to top

Other Records

Field	Value
sitemap	https://jp.reuters.com/arc/outboundfeeds/sitemap-index/?outputType=xml
sitemap	https://jp.reuters.com/arc/outboundfeeds/news-sitemap-index/?outputType=xml
sitemap	https://jp.reuters.com/static/video-sitemap/jp/sitemap_video_index.xml

Field

Value

sitemap

https://jp.reuters.com/arc/outboundfeeds/sitemap-index/?outputType=xml

sitemap

https://jp.reuters.com/arc/outboundfeeds/news-sitemap-index/?outputType=xml

sitemap

https://jp.reuters.com/static/video-sitemap/jp/sitemap_video_index.xml

Back to top

Comments

robots.txt for www.jp.reuters.com

Back to top

jp.reuters.comrobots.txt

Resource Scan

Scan Details

Last Scan

Groups

piplbot

ccbot

anthropic-ai

claude-web

google-extended

facebookbot

*

Other Records

Comments

jp.reuters.com
robots.txt