klear.com
robots.txt

Robots Exclusion Standard data for klear.com

Resource Scan

Scan Details

Site Domain klear.com
Base Domain klear.com
Scan Status Ok
Last Scan2024-06-16T08:44:56+00:00
Next Scan 2024-06-30T08:44:56+00:00

Last Scan

Scanned2024-06-16T08:44:56+00:00
URL https://klear.com/robots.txt
Domain IPs 34.248.129.64, 52.214.102.187, 52.48.75.209
Response IP 52.214.102.187
Found Yes
Hash c6118547fa046500bb599e156dc52d523bc6def649fe60f3a0d05455029316a7
SimHash 4b0d41434294

Groups

*

Rule Path
Disallow /search.php?tweet_id=*
Disallow /autocomplete*
Disallow /search_instagram_network.php
Disallow /update/*
Disallow /ws/ws_more_search_results.php
Disallow /ws/*
Disallow /p/*
Disallow /signin/*
Disallow /api/*
Disallow /mobile/switch_to_mobile
Disallow /honey.php

mediapartners-google

Rule Path
Disallow

msnbot

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 3

vegi bot

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 50

Other Records

Field Value
sitemap https://klear.com/smIndex.xml