learndesk.us
robots.txt
Robots Exclusion Standard data for learndesk.us
Resource Scan
Scan Details
Site Domain | learndesk.us |
Base Domain | learndesk.us |
Scan Status | Ok |
Last Scan | 2024-09-19T17:03:11+00:00 |
Next Scan | 2024-10-19T17:03:11+00:00 |
Last Scan
Scanned | 2024-09-19T17:03:11+00:00 |
URL | https://www.learndesk.us/robots.txt |
Domain IPs | 96.17.96.16, 96.17.96.28 |
Response IP | 23.44.4.171 |
Found | Yes |
Hash | 0d92b73ef7bc4c30697129a3c981c2bae2078b6cd79e5022ba619a789e0f3621 |
SimHash | 0b9f2d87df95 |
Groups
googlebot
bingbot
slurp
msnbot
mediapartners-google*
googlebot-image
yahoo-mmcrawler
ia_archiver
naverbot
yeti
yandexbot
yandexdirect
yandexdirectdyn
yandexmedia
yandeximages
yadirectfetcher
yandexpagechecker
Rule | Path |
---|---|
Disallow | /p/ |
Disallow | */write_review/* |
Disallow | *writeareview$ |
Disallow | *news-updates$ |
Disallow | *news-updates/amp$ |
Disallow | *where-to-buy$ |
Disallow | */questions$ |
Disallow | */questions/amp$ |
Disallow | */substitutes$ |
Disallow | */substitutes/amp$ |
Disallow | */reports$ |
Disallow | *chunked* |
Disallow | *expert-content* |
Disallow | /redirect* |