javiercallejo.net
robots.txt

Robots Exclusion Standard data for javiercallejo.net

Resource Scan

Scan Details

Site Domain javiercallejo.net
Base Domain javiercallejo.net
Scan Status Ok
Last Scan2025-03-07T05:50:10+00:00
Next Scan 2025-03-14T05:50:10+00:00

Last Scan

Scanned2025-03-07T05:50:10+00:00
URL https://javiercallejo.net/robots.txt
Domain IPs 104.21.52.184, 172.67.202.216, 2606:4700:3030::6815:34b8, 2606:4700:3032::ac43:cad8
Response IP 104.21.52.184
Found Yes
Hash cf2cf21c7384dd94d11f4c0647f2daca21b0da5b2b6deb34be623c7c5a5cad66
SimHash 6e2a18808fb5

Groups

*

Rule Path Comment
Disallow /wp-admin/ -
Disallow /wp-login.php block access to admin section
Disallow /comments/feed -
Disallow /feed/ -
Disallow /search/ block access to internal search result pages
Disallow *?s=* block access to internal search result pages
Disallow *?p=* block access to pages for which permalinks fails
Disallow *%26p%3D* block access to pages for which permalinks fails
Disallow /tag/ block access to tag pages
Disallow /author/ block access to author pages
Disallow /404-error/ block access to 404 page
Allow /wp-admin/admin-ajax.php -
Disallow /page/ -
Disallow /2/ -
Disallow /3/ -
Disallow /4/ -
Disallow /5/ -
Disallow /6/ -

wget
curl
msiecrawler
webcopier
httrack
microsoft.url.control
libwww
muieblackcat
duggmirror
slurp

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 10

Other Records

Field Value
sitemap https://javiercallejo.net/sitemap_index.xml