top10.digital
robots.txt

Robots Exclusion Standard data for top10.digital

Resource Scan

Scan Details

Site Domain top10.digital
Base Domain top10.digital
Scan Status Ok
Last Scan2026-03-01T03:03:53+00:00
Next Scan 2026-03-08T03:03:53+00:00

Last Scan

Scanned2026-03-01T03:03:53+00:00
URL https://top10.digital/robots.txt
Domain IPs 161.97.100.103
Response IP 161.97.100.103
Found Yes
Hash 766632836739adfb701791d79131ad354f54cdcf0cf686e0cdb10848be3b1bae
SimHash 6054cc924b83

Groups

googlebot

Rule Path
Disallow

googlebot-image

Rule Path
Disallow

googlebot-mobile

Rule Path
Disallow

msnbot

Rule Path
Disallow

slurp

Rule Path
Disallow

teoma

Rule Path
Disallow

gigabot

Rule Path
Disallow

robozilla

Rule Path
Disallow

nutch

Rule Path
Disallow

ia_archiver

Rule Path
Disallow

baiduspider

Rule Path
Disallow

naverbot

Rule Path
Disallow

yeti

Rule Path
Disallow

yahoo-mmcrawler

Rule Path
Disallow

psbot

Rule Path
Disallow

yahoo-blogs/v3.9

Rule Path
Disallow

*

Rule Path
Disallow /wp-admin/
Disallow /page/
Disallow /sort
Disallow /admin/
Disallow /logout
Disallow /node/add
Disallow /user/register
Disallow /user/password
Disallow /user/login

ahrefssiteaudit

Rule Path
Allow /

ahrefsbot

Rule Path
Allow /

Other Records

Field Value
sitemap https://top10.digital/sitemap_index.xml
sitemap https://top10.digital/news-sitemap.xml

Warnings

  • 1 invalid line.