aftershock.news
robots.txt

Robots Exclusion Standard data for aftershock.news

Resource Scan

Scan Details

Site Domain aftershock.news
Base Domain aftershock.news
Scan Status Ok
Last Scan2024-06-07T21:11:28+00:00
Next Scan 2024-06-14T21:11:28+00:00

Last Scan

Scanned2024-06-07T21:11:28+00:00
URL https://aftershock.news/robots.txt
Domain IPs 178.208.71.17
Response IP 178.208.71.17
Found Yes
Hash be9d9cad8e5ecf2618fa5779ff770d0d351e482c082a81d92bcfc232cdd93179
SimHash 6ffad663c7b4

Groups

bingbot

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 60

*

Rule Path
Disallow /*
Disallow /?q=blog%2F*
Disallow /?q=blogs&*
Disallow /?q=blog&*
Disallow /?q=front&*
Disallow /?q=all&*
Disallow /?q=printing%2F*%2F
Disallow /?q=printing%2F*&
Disallow /?q=user%2F*%2F
Disallow /?q=user%2F*%2F
Disallow /?q=node%2F*%2F
Disallow /?q=node%2F*%2F
Disallow /?q=comment%2F
Allow /$
Allow /?q=front
Allow /?q=blog
Allow /?q=all
Allow /?q=printing%2F
Allow /?q=node%2F
Allow /?q=node%2F*
Allow /?q=user%2F
Allow /?q=user%2F*
Allow /?q=sitemap.xml*
Allow /sites/default/files/
Allow /journal/yandex_rss/all/
Allow /zen_feed
Allow /?q=zen_feed
Allow /sites/default/files/

Other Records

Field Value
crawl-delay 1

Other Records

Field Value
sitemap https://aftershock.news/?q=sitemap.xml

Warnings

  • `host` is not a known field.