languagetechnology.pangeanic.com
robots.txt

Robots Exclusion Standard data for languagetechnology.pangeanic.com

Resource Scan

Scan Details

Site Domain languagetechnology.pangeanic.com
Base Domain pangeanic.com
Scan Status Ok
Last Scan2024-09-06T04:29:44+00:00
Next Scan 2024-10-06T04:29:44+00:00

Last Scan

Scanned2024-09-06T04:29:44+00:00
URL https://languagetechnology.pangeanic.com/robots.txt
Redirect https://pangeanic.com/robots.txt
Redirect Domain pangeanic.com
Redirect Base pangeanic.com
Domain IPs 199.60.103.226, 199.60.103.30, 2606:2c40::c73c:671e, 2606:2c40::c73c:67e2
Redirect IPs 199.60.103.102, 199.60.103.2
Response IP 199.60.103.2
Found Yes
Hash d68c028861e37f0e57b69242f35cd55131f78e119f1524904d662828ed1ed7e3
SimHash 3871c4f1c7f1

Groups

openai

Rule Path
Disallow /
Disallow /_hcms/preview/
Disallow /hs/manage-preferences/
Disallow /hs/preferences-center/
Disallow /*?*hs_preview=*
Disallow /*?*hsCacheBuster=*

*

Rule Path
Disallow /_hcms/preview/
Disallow /hs/manage-preferences/
Disallow /hs/preferences-center/
Disallow /*?*hs_preview=*
Disallow /*?*hsCacheBuster=*