gendai.ismedia.jp
robots.txt

Robots Exclusion Standard data for gendai.ismedia.jp

Resource Scan

Scan Details

Site Domain gendai.ismedia.jp
Base Domain ismedia.jp
Scan Status Ok
Last Scan2024-10-29T18:58:25+00:00
Next Scan 2024-11-28T18:58:25+00:00

Last Scan

Scanned2024-10-29T18:58:25+00:00
URL https://gendai.ismedia.jp/robots.txt
Redirect https://gendai.media/robots.txt
Redirect Domain gendai.media
Redirect Base gendai.media
Domain IPs 210.148.177.138
Redirect IPs 163.49.35.159
Response IP 163.49.35.159
Found Yes
Hash 45bfe114daa7bf6a4bc945a489f4283e89ea4a898a6b251ec70451a18cebd7c8
SimHash 731e8151c2d3

Groups

*

Rule Path
Disallow /search
Disallow /ud/pressrelease/
Disallow /list/author-book-list/
Disallow /articles/-/65274

ccbot
gptbot
applebot-extended
google-extended
meta-externalagent
claudebot
anthropic-ai
claude-web
cohere-ai
omgili
timpibot
webzio-extended
bytespider
icc-crawler

Rule Path
Disallow /

Other Records

Field Value
sitemap https://gendai.media/sitemap.xml