internetsearchinc.com
robots.txt

Robots Exclusion Standard data for internetsearchinc.com

Resource Scan

Scan Details

Site Domain internetsearchinc.com
Base Domain internetsearchinc.com
Scan Status Failed
Failure StageFetching resource.
Failure ReasonServer returned a client error.
Last Scan2025-09-27T08:38:35+00:00
Next Scan 2025-10-04T08:38:35+00:00

Last Successful Scan

Scanned2025-08-27T06:18:35+00:00
URL https://internetsearchinc.com/robots.txt
Domain IPs 104.21.91.216, 172.67.180.195, 2606:4700:3033::ac43:b4c3, 2606:4700:3036::6815:5bd8
Response IP 172.67.180.195
Found Yes
Hash 27ea91231116362226f0a19203effe4796d376cb915c6c1df86dc28b2ac12c3b
SimHash 6b88d9c1cab1

Groups

*
gptbot
chatgpt-user
ccbot
anthropic-ai
google-extended
facebookbot
claude-web
cohere-ai
perplexitybot
applebot-extended
adsbot-google
adsbot-google-mobile
adsbot-google-mobile-apps
googlebot
*

Rule Path
Allow /
Disallow /feed/
Disallow */*/feed
Disallow /nogooglebot/
Disallow /dev/
Disallow */*/feed
Disallow *?share=
Disallow /wp-admin/
Disallow /wp-login.php
Disallow /?s*
Disallow /*feed*
Disallow /wp-includes/
Disallow /cart/
Disallow /*add-to-cart%3D*
Disallow /checkout/
Disallow /my-account/
Disallow /wp-content/plugins/
Disallow /readme.html
Disallow /refer/
Allow /wp-content/uploads/
Allow /wp-admin/admin-ajax.php

Other Records

Field Value
crawl-delay 20

Other Records

Field Value
sitemap https://hawaiitourstravel.com/sitemap_index.xml
sitemap https://hawaiitourstravel.com/sitemap/

Comments

  • Default robots file