newsdig.tbs.co.jp
robots.txt

Robots Exclusion Standard data for newsdig.tbs.co.jp

Resource Scan

Scan Details

Site Domain newsdig.tbs.co.jp
Base Domain tbs.co.jp
Scan Status Ok
Last Scan2024-04-28T13:52:08+00:00
Next Scan 2024-05-05T13:52:08+00:00

Last Scan

Scanned2024-04-28T13:52:08+00:00
URL https://newsdig.tbs.co.jp/robots.txt
Domain IPs 163.49.35.137
Response IP 163.49.35.137
Found Yes
Hash 835fb4f88beacc587021271ffb03cc9b34dac9eb5ee30294a076e9ddcf74e210
SimHash b83cd811e133

Groups

*

Rule Path
Disallow /list/prtimes/
Disallow /common/senkyo/
Disallow /common/xml/
Disallow /list/search?*html$
Disallow /list/search?fulltext=*%E3%80%90

chatgpt-user

Rule Path
Disallow /

gptbot

Rule Path
Disallow /

ccbot

Rule Path
Disallow /

google-extended

Rule Path
Disallow /

Other Records

Field Value
sitemap https://newsdig.tbs.co.jp/sitemap.xml