filedesc.com
robots.txt

Robots Exclusion Standard data for filedesc.com

Resource Scan

Scan Details

Site Domain filedesc.com
Base Domain filedesc.com
Scan Status Ok
Last Scan2024-10-03T04:06:50+00:00
Next Scan 2024-10-10T04:06:50+00:00

Last Scan

Scanned2024-10-03T04:06:50+00:00
URL https://filedesc.com/robots.txt
Domain IPs 104.26.4.129, 104.26.5.129, 172.67.74.108, 2606:4700:20::681a:481, 2606:4700:20::681a:581, 2606:4700:20::ac43:4a6c
Response IP 104.26.5.129
Found Yes
Hash f14c347cd832bd7e3e270ed5695f096e0fa7b62897ab966c7269984c3476ab9b
SimHash 119553582d90

Groups

*

Rule Path
Disallow /search
Disallow /autocomp/
Disallow /*/search
Disallow /*/autocomp/
Disallow /cdn-cgi/

ahrefsbot
amazonbot
anthropic-ai
applebot
awariobot
awariorssbot
barkrowler
blexbot
buck
ccbot
chatgpt-user
cohere-ai
dataforseobot
domainsproject.org
dotbot
ezoicbot
facebookbot
gptbot
grapeshot
imagesiftbot
linguee
mail.ru
mauibot
meta-externalagent
mj12bot
mtrobot
omgilibot
panscient.com
petalbot
proximic
scrapy
seekportbot
semrushbot
serpstatbot
verity

Rule Path
Disallow /

Other Records

Field Value
sitemap https://cdn.filedesc.com/sitemap.xml