kaldata.com
robots.txt

Robots Exclusion Standard data for kaldata.com

Resource Scan

Scan Details

Site Domain kaldata.com
Base Domain kaldata.com
Scan Status Ok
Last Scan2024-05-27T02:30:32+00:00
Next Scan 2024-06-03T02:30:32+00:00

Last Scan

Scanned2024-05-27T02:30:32+00:00
URL https://kaldata.com/robots.txt
Redirect https://www.kaldata.com/robots.txt
Redirect Domain www.kaldata.com
Redirect Base kaldata.com
Domain IPs 104.26.6.17, 104.26.7.17, 172.67.71.13, 2606:4700:20::681a:611, 2606:4700:20::681a:711, 2606:4700:20::ac43:470d
Redirect IPs 104.26.6.17, 104.26.7.17, 172.67.71.13, 2606:4700:20::681a:611, 2606:4700:20::681a:711, 2606:4700:20::ac43:470d
Response IP 172.67.71.13
Found Yes
Hash 16e2d25f3fa7e23469fb2165ec03a4f5916ecd6e7dac4a170de54ee244e7eb51
SimHash d361a37d0e67

Groups

*

Rule Path
Allow /
Disallow /forums/startTopic/
Disallow /forums/discover/unread/
Disallow /forums/markallread/
Disallow /forums/staff/
Disallow /forums/cookie/
Disallow /forums/online/
Disallow /forums/discover/
Disallow /forums/leaderboard/
Disallow /forums/search/
Disallow /forums/tags/
Disallow /forums/*?advancedSearchForm=
Disallow /forums/register/
Disallow /forums/lostpassword/
Disallow /forums/login/
Disallow /forums/*?sortby=
Disallow /forums/*?filter=
Disallow /forums/*?tab=
Disallow /forums/*?do=
Disallow /forums/*ref%3D
Disallow /forums/*?forumId*
Disallow /forums/*?&controller=embed

rogerbot
mj12bot
exabot
dotbot
gigabot
blackwidow
chinaclaw
custo
disco
download\ demon
ecatch
eirgrabber
emailsiphon
emailwolf
express\ webpictures
extractorpro
eyenetie
flashget
getright
getweb!
go!zilla
go-ahead-got-it
grabnet
grafula
hmview
httrack
image\ stripper
image\ sucker
indy\ library
interget
internet\ ninja
jetcar
joc\ web\ spider
larbin
leechftp
mass\ downloader
midown\ tool
mister\ pix
navroad
nearsite
netants
netspider
net\ vampire
netzip
octopus
offline\ explorer
offline\ navigator
pagegrabber
papa\ foto
pavuk
pcbrowser
realdownload
reget
sitesnagger
smartdownload
superbot
superhttp
surfbot
takeout
teleport\ pro
voideye
web\ image\ collector
web\ sucker
webauto
webcopier
webfetch
webgo\ is
webleacher
webreaper
websauger
website\ extractor
website\ quester
webstripper
webwhacker
webzip
wget
widow
wwwoffle
xaldon\ webspider
zeus
magpie-crawler
google-extended
chatgpt-user
ccbot

Rule Path
Disallow /

Other Records

Field Value
sitemap https://www.kaldata.com/sitemap_index.xml
sitemap https://www.kaldata.com/forums/sitemap.php
sitemap https://www.kaldata.com/news-sitemap.xml

Warnings

  • `host` is not a known field.