ctpaha.media
robots.txt

Robots Exclusion Standard data for ctpaha.media

Resource Scan

Scan Details

Site Domain ctpaha.media
Base Domain ctpaha.media
Scan Status Failed
Failure StageFetching resource.
Failure ReasonServer returned a client error.
Last Scan2024-09-07T09:15:03+00:00
Next Scan 2024-10-07T09:15:03+00:00

Last Successful Scan

Scanned2024-08-09T09:13:56+00:00
URL https://ctpaha.media/robots.txt
Domain IPs 104.21.17.156, 172.67.177.17, 2606:4700:3030::6815:119c, 2606:4700:3031::ac43:b111
Response IP 172.67.177.17
Found Yes
Hash b1eba8584217baa67d3da09e5a7bd8802b36bd3eb466e6b4644452a33e05cc51
SimHash 0425f874ee1a

Groups

*

Rule Path
Disallow /search
Disallow /sunsite/
Disallow /print/
Disallow /exec/
Disallow */page-*
Disallow */day%3D*
Allow /*.js?
Allow /*.css?
Allow /*.jpg?
Allow /*.jpeg?
Allow /*.gif?
Allow /*.png?

Other Records

Field Value
sitemap https://ctrana.news/sitemap.xml
sitemap https://ctrana.news/sitemap_latest.xml

Warnings

  • `host` is not a known field.