internexa.com
robots.txt

Robots Exclusion Standard data for internexa.com

Resource Scan

Scan Details

Site Domain internexa.com
Base Domain internexa.com
Scan Status Ok
Last Scan2024-11-07T17:05:35+00:00
Next Scan 2024-12-07T17:05:35+00:00

Last Scan

Scanned2024-11-07T17:05:35+00:00
URL https://www.internexa.com/robots.txt
Domain IPs 199.60.103.226, 199.60.103.30, 2606:2c40::c73c:671e, 2606:2c40::c73c:67e2
Response IP 199.60.103.226
Found Yes
Hash bfb2ec61cda5ee9a3218cb8be18d63d4c5c94305c15802a4c52ed5dc11b1db99
SimHash 3af58c60c6b9

Groups

*

Rule Path
Disallow /sample-*
Disallow /blog/sample*
Disallow /tag/*
Disallow /page/*
Disallow /author/*
Disallow /_hcms/preview/
Disallow /hs/manage-preferences/
Disallow /hs/preferences-center/
Disallow /*?*hs_preview=*
Disallow /*?*hsCacheBuster=*

Other Records

Field Value
sitemap https://blog.internexa.com/sitemap.xml