sitesimilar.net
robots.txt

Robots Exclusion Standard data for sitesimilar.net

Resource Scan

Scan Details

Site Domain sitesimilar.net
Base Domain sitesimilar.net
Scan Status Failed
Failure StageFetching resource.
Failure ReasonServer returned a client error.
Last Scan2024-03-13T03:55:54+00:00
Next Scan 2024-06-11T03:55:54+00:00

Last Successful Scan

Scanned2022-09-28T07:06:15+00:00
URL https://sitesimilar.net/robots.txt
Redirect https://www.sitesimilar.net/robots.txt
Redirect Domain www.sitesimilar.net
Redirect Base sitesimilar.net
Response IP 172.67.72.153, 104.26.13.79, 104.26.12.79
Found Yes
Hash c09d3bf7217bb1d306f40945ea4c5657a26c442473d5f2f99890d0be9f6e6176
SimHash 593dd1c0bb9a

Groups

*

Rule Path
Disallow /cdn-cgi/
Disallow /r/
Disallow /sites/
Disallow /tag/

yandex

Rule Path
Disallow /

yandexbot

Rule Path
Disallow /

blexbot

Rule Path
Disallow /

ahrefsbot

Rule Path
Disallow /

vagabondo

Rule Path
Disallow /

seokicks-robot

Rule Path
Disallow /

ia_archiver

Rule Path
Disallow /

archive.org_bot

Rule Path
Disallow /

special_archiver

Rule Path
Disallow /

mj12bot

Rule Path
Disallow /

special_archiver

Rule Path
Disallow /

heritrix

Rule Path
Disallow /

netestate ne crawler

Rule Path
Disallow /

sistrix

Rule Path
Disallow /

wbsearchbot

Rule Path
Disallow /

queryseekerspider

Rule Path
Disallow /

proximic

Rule Path
Disallow /

siteexplorer

Rule Path
Disallow /

semrushbot

Rule Path
Disallow /

semrushbot-sa

Rule Path
Disallow /

adsbot

Rule Path
Disallow /

dataforseobot

Rule Path
Disallow /

neevabot

Rule Path
Disallow /

Other Records

Field Value
sitemap https://www.sitesimilar.net/sitemap.xml

Warnings

  • 2 invalid lines.