ciro.pe
robots.txt

Robots Exclusion Standard data for ciro.pe

Resource Scan

Scan Details

Site Domain ciro.pe
Base Domain ciro.pe
Scan Status Ok
Last Scan2025-12-22T10:46:28+00:00
Next Scan 2026-01-21T10:46:28+00:00

Last Scan

Scanned2025-12-22T10:46:28+00:00
URL https://ciro.pe/robots.txt
Domain IPs 104.21.5.118, 172.67.133.97, 2606:4700:3030::6815:576, 2606:4700:3031::ac43:8561
Response IP 104.21.5.118
Found Yes
Hash a4c9c9522b1ced7af762d9caa947d8ecbcf4e5967f9d3c461ca53e2adc324706
SimHash eaa81898e2a8

Groups

*

Rule Path Comment
Disallow /wp-admin/ -
Allow /wp-admin/admin-ajax.php -
Disallow *?s=* block access to internal search result pages
Disallow /search/ block access to internal search result pages
Disallow */feed/ -
Disallow /cgi-bin/ -
Disallow /wp-login.php -
Disallow /wp-register.php -
Disallow /xmlrpc.php -
Disallow /wp-includes/ -
Disallow /wp-content/plugins/ -
Disallow /wp-content/themes/ -
Disallow /trackback/ -
Disallow /feed/ -
Disallow /*/feed/ -
Disallow /*? -
Disallow /readme.html -
Allow /wp-content/uploads/ -
Allow /wp-admin/admin-ajax.php -
Allow /wp-includes/js/ -

googlebot

Rule Path
Allow /*.js$
Allow /*.css$
Disallow /*.php$
Disallow /wp-
Disallow /?s=

bingbot

Rule Path
Allow /*.js$
Allow /*.css$
Disallow /*.php$
Disallow /wp-
Disallow /?s=
Disallow /*blackhole
Disallow /?blackhole

*

Rule Path
Disallow /wp-content/uploads/wpo-plugins-tables-list.json

Other Records

Field Value
sitemap http://ciroweb-wordpress-3e4d10-34-42-234-76.traefik.me/sitemap_index.xml
sitemap http://ciroweb-wordpress-3e4d10-34-42-234-76.traefik.me/page-sitemap.xml

Comments

  • Block specific URLs
  • Disallow: /unimportant-page/
  • Disallow: /example/