ittoluca.edu.mx
robots.txt

Robots Exclusion Standard data for ittoluca.edu.mx

Resource Scan

Scan Details

Site Domain ittoluca.edu.mx
Base Domain ittoluca.edu.mx
Scan Status Ok
Last Scan2025-12-25T05:32:03+00:00
Next Scan 2026-01-24T05:32:03+00:00

Last Scan

Scanned2025-12-25T05:32:03+00:00
URL https://ittoluca.edu.mx/robots.txt
Redirect https://www.tolucatecnm.mx/robots.txt
Redirect Domain www.tolucatecnm.mx
Redirect Base tolucatecnm.mx
Domain IPs 74.208.22.48
Redirect IPs 104.21.32.33, 172.67.182.152, 2606:4700:3031::ac43:b698, 2606:4700:3033::6815:2021
Response IP 172.67.182.152
Found Yes
Hash 659a0894f892fd54e37907d0bb0e5153238573235366fe9d21d8031a093aafea
SimHash 4a3fdc20ba92

Groups

*

Rule Path
Disallow /cgi-bin
Disallow /wp-login.php
Disallow /wp-admin/
Disallow /wordpress/
Disallow /login/
Disallow /login
Disallow /ajax/
Disallow /api/
Disallow /blank.html

Other Records

Field Value
crawl-delay 10

blexbot

Rule Path
Disallow /

mj12bot

Rule Path
Disallow /

twengabot

Rule Path
Disallow /

ahrefsbot

Rule Path
Disallow /

semrushbot

Rule Path
Disallow /

wotbox

Rule Path
Disallow /

sosospider

Rule Path
Disallow /

baiduspider

Rule Path
Disallow /

zumbot

Rule Path
Disallow /

Other Records

Field Value
sitemap https://www.tolucatecnm.mx/sitemap.xml

Warnings

  • 2 invalid lines.