gendai.media
robots.txt

Robots Exclusion Standard data for gendai.media

Resource Scan

Scan Details

Site Domain gendai.media
Base Domain gendai.media
Scan Status Ok
Last Scan2024-09-27T12:44:57+00:00
Next Scan 2024-10-04T12:44:57+00:00

Last Scan

Scanned2024-09-27T12:44:57+00:00
URL https://gendai.media/robots.txt
Domain IPs 163.49.35.159
Response IP 163.49.35.159
Found Yes
Hash afaac48c36b1a47890b5fb664ea0808fbef561d99ef6457a8cca209c3069a59a
SimHash 4b1c9870c3d3

Groups

*

Rule Path
Disallow /search
Disallow /ud/pressrelease/
Disallow /list/author-book-list/
Disallow /articles/-/65274

gptbot

Rule Path
Disallow /

ccbot

Rule Path
Disallow /

Other Records

Field Value
sitemap https://gendai.media/sitemap.xml