filedesc.com
robots.txt
Robots Exclusion Standard data for filedesc.com
Resource Scan
Scan Details
Site Domain | filedesc.com |
Base Domain | filedesc.com |
Scan Status | Ok |
Last Scan | 2024-10-03T04:06:50+00:00 |
Next Scan | 2024-10-10T04:06:50+00:00 |
Last Scan
Scanned | 2024-10-03T04:06:50+00:00 |
URL | https://filedesc.com/robots.txt |
Domain IPs | 104.26.4.129, 104.26.5.129, 172.67.74.108, 2606:4700:20::681a:481, 2606:4700:20::681a:581, 2606:4700:20::ac43:4a6c |
Response IP | 104.26.5.129 |
Found | Yes |
Hash | f14c347cd832bd7e3e270ed5695f096e0fa7b62897ab966c7269984c3476ab9b |
SimHash | 119553582d90 |
Groups
*
Rule | Path |
---|---|
Disallow | /search |
Disallow | /autocomp/ |
Disallow | /*/search |
Disallow | /*/autocomp/ |
Disallow | /cdn-cgi/ |
ahrefsbot
amazonbot
anthropic-ai
applebot
awariobot
awariorssbot
barkrowler
blexbot
buck
ccbot
chatgpt-user
cohere-ai
dataforseobot
domainsproject.org
dotbot
ezoicbot
facebookbot
gptbot
grapeshot
imagesiftbot
linguee
mail.ru
mauibot
meta-externalagent
mj12bot
mtrobot
omgilibot
panscient.com
petalbot
proximic
scrapy
seekportbot
semrushbot
serpstatbot
verity
Rule | Path |
---|---|
Disallow | / |
Other Records
Field | Value |
---|---|
sitemap | https://cdn.filedesc.com/sitemap.xml |