click.in
robots.txt

Robots Exclusion Standard data for click.in

Resource Scan

Scan Details

Site Domain click.in
Base Domain click.in
Scan Status Ok
Last Scan2024-09-17T06:58:45+00:00
Next Scan 2024-09-24T06:58:45+00:00

Last Scan

Scanned2024-09-17T06:58:45+00:00
URL https://click.in/robots.txt
Redirect https://www.click.in/robots.txt
Redirect Domain www.click.in
Redirect Base click.in
Domain IPs 104.18.32.230, 172.64.155.26, 2606:4700:4400::6812:20e6, 2606:4700:4400::ac40:9b1a
Redirect IPs 104.18.32.230, 172.64.155.26, 2606:4700:4400::6812:20e6, 2606:4700:4400::ac40:9b1a
Response IP 104.18.32.230
Found Yes
Hash 963b6dacf31292270986309e9e597b92af21f0b04cec4559a6b32c7180255a1b
SimHash 1338bc60af89

Groups

*

Rule Path
Disallow /temp/
Disallow /classifieds/mylisting/
Disallow /xmldir/
Disallow /rss_files/
Disallow /rss/
Disallow /app_feed/
Disallow /xml_feed/
Disallow /*spam.html$
Disallow /*miscat.html$
Disallow /*classifieds/dashboard_search.php*
Disallow /*/hsearch/
Allow /

dwaarbot

Rule Path
Disallow /

seekbot

Rule Path
Disallow /

pete-spider light

Rule Path
Disallow /

dotbot

Rule Path
Disallow /

twiceler

Rule Path
Disallow /

baiduspider

Rule Path
Disallow /

openindexspider

Rule Path
Disallow /

ahrefsbot

Rule Path
Disallow /

sistrix

Rule Path
Disallow /

semrushbot

Rule Path
Disallow /

semrushbot-sa

Rule Path
Disallow /

proximic

Rule Path
Disallow /

mj12bot

Rule Path
Disallow /

sputnikbot

Rule Path
Disallow /

riddler

Rule Path
Disallow /

grapeshot

Rule Path
Disallow /

Other Records

Field Value
sitemap https://www.click.in/sitemap_index.xml
sitemap https://www.click.in/sitemap_latest_index.xml
sitemap https://www.click.in/sitemap_category_filter_index.xml

Warnings

  • 2 invalid lines.