schoolandcollegelistings.com
robots.txt

Robots Exclusion Standard data for schoolandcollegelistings.com

Resource Scan

Scan Details

Site Domain schoolandcollegelistings.com
Base Domain schoolandcollegelistings.com
Scan Status Ok
Last Scan2024-11-09T18:12:11+00:00
Next Scan 2024-11-16T18:12:11+00:00

Last Scan

Scanned2024-11-09T18:12:11+00:00
URL https://schoolandcollegelistings.com/robots.txt
Redirect https://www.schoolandcollegelistings.com/robots.txt
Redirect Domain www.schoolandcollegelistings.com
Redirect Base schoolandcollegelistings.com
Domain IPs 104.21.19.210, 172.67.190.52, 2606:4700:3034::6815:13d2, 2606:4700:3037::ac43:be34
Redirect IPs 104.21.19.210, 172.67.190.52, 2606:4700:3034::6815:13d2, 2606:4700:3037::ac43:be34
Response IP 104.21.19.210
Found Yes
Hash 6c53218190b823c5f7fceed8f29acfe116c4afc7826395b9ddb05cf80f0a9cc9
SimHash d82d45416221

Groups

*

Rule Path
Disallow /town/
Disallow /towns/
Disallow /country/
Disallow /countries/
Disallow /search/
Disallow /directors/
Disallow /director/
Disallow /actors/
Disallow /actor/
Disallow /genres/
Disallow /genre/
Disallow /producer/
Disallow /screenplaywriter/
Disallow /studio/
Disallow /writer/
Disallow /vicinitysearch
Disallow /*/cities$
Disallow /*/cities/
Disallow /*/showImage%28
Disallow /pv/
Disallow /rf/
Disallow /login/
Disallow /about/
Disallow /contact/
Disallow /cdn-cgi/

semrushbot-sa
semrushbot
blexbot
mj12bot
ahrefsbot
dotbot
megaindex.ru
megaindex.com
mauibot
mauibot (crawler.feedback+wc@gmail.com)
the knowledge ai

Rule Path
Disallow /