crushthelsatexam.com
robots.txt

Robots Exclusion Standard data for crushthelsatexam.com

Resource Scan

Scan Details

Site Domain crushthelsatexam.com
Base Domain crushthelsatexam.com
Scan Status Ok
Last Scan2025-03-11T07:55:01+00:00
Next Scan 2025-03-18T07:55:01+00:00

Last Scan

Scanned2025-03-11T07:55:01+00:00
URL https://crushthelsatexam.com/robots.txt
Domain IPs 104.21.83.63, 172.67.214.247, 2606:4700:3033::ac43:d6f7, 2606:4700:3036::6815:533f
Response IP 172.67.214.247
Found Yes
Hash 2a16913d6b907931d7485e2a8badfe15c33be7c1861c911ef7b6644c80a71988
SimHash c9008e500643

Groups

googlebot

Rule Path
Disallow /*?
Disallow /*?cc=*
Disallow /*?lang=*
Disallow /*?lang*

*

Rule Path
Disallow /wp-admin/
Disallow /*?
Disallow /*?cc=*
Disallow /*?lang=*
Disallow /*?lang*
Allow /wp-admin/admin-ajax.php

ia_archiver

Rule Path
Disallow /
Disallow /*?cc=*

archive.org_bot

Rule Path
Disallow /
Disallow /*?cc=*

googlebot-image

Rule Path
Disallow
Disallow /*?cc=*

*

Rule Path
Disallow /wp-content/uploads/wp-import-export-lite/
Disallow /*?cc=*

Other Records

Field Value
sitemap https://crushthelsatexam.com/sitemap_index.xml

Comments

  • WP Import Export Rule