mcpedl.org
robots.txt

Robots Exclusion Standard data for mcpedl.org

Resource Scan

Scan Details

Site Domain mcpedl.org
Base Domain mcpedl.org
Scan Status Ok
Last Scan2024-09-20T16:05:54+00:00
Next Scan 2024-09-27T16:05:54+00:00

Last Scan

Scanned2024-09-20T16:05:54+00:00
URL https://mcpedl.org/robots.txt
Domain IPs 104.21.19.193, 172.67.188.147, 2606:4700:3033::6815:13c1, 2606:4700:3034::ac43:bc93
Response IP 104.21.19.193
Found Yes
Hash ba84e4c8e243223fe036eb07fdafbd962fc92f06b1ac7c213e9aa1fb2e3e0ad0
SimHash 2590b3731db0

Groups

*

Rule Path
Disallow /cgi-bin
Disallow /?
Disallow /wp-
Disallow *?s=
Disallow *%26s%3D
Disallow /search
Disallow *?attachment_id=
Disallow */feed
Disallow */rss
Disallow */embed
Disallow */tag
Disallow /wp/
Disallow /uploads_files/*
Allow */uploads
Allow /*/*.js
Allow /*/*.css
Allow /wp-*.png
Allow /wp-*.jpg
Allow /wp-*.jpeg
Allow /wp-*.gif
Allow /wp-*.svg
Allow /wp-*.pdf

Other Records

Field Value
sitemap https://mcpedl.org/sitemap_index.xml

Warnings

  • `host` is not a known field.