harvardjsel.com
robots.txt

Robots Exclusion Standard data for harvardjsel.com

Resource Scan

Scan Details

Site Domain harvardjsel.com
Base Domain harvardjsel.com
Scan Status Ok
Last Scan2026-01-17T02:58:01+00:00
Next Scan 2026-01-31T02:58:01+00:00

Last Scan

Scanned2026-01-17T02:58:01+00:00
URL https://harvardjsel.com/robots.txt
Redirect https://xoilac86xt.tv/robots.txt
Redirect Domain xoilac86xt.tv
Redirect Base xoilac86xt.tv
Domain IPs 104.18.33.175, 172.64.154.81, 2606:4700:4408::6812:21af, 2a06:98c1:3104::ac40:9a51
Redirect IPs 104.18.8.102, 104.18.9.102, 2606:4700::6812:866, 2606:4700::6812:966
Response IP 104.18.8.102
Found Yes
Hash 4f52afa04d450bf2fe0e6a865c5baa27ce5a8ad0c581a60b1998bdbde145c896
SimHash 6b58d80049a2

Groups

*

Rule Path
Disallow /wp-admin/
Allow /wp-admin/admin-ajax.php
Disallow /author/
Disallow /*/trackback
Disallow /tag/
Disallow /*/feed
Disallow /?s=*
Disallow /attachment/
Disallow /*?utm_source
Disallow /*%26utm_source

facebookexternalhit

Rule Path
Allow /

Other Records

Field Value
sitemap https://xoilac86xt.tv/sitemap.xml

Comments

  • Allow Facebook scraper