/.well-known/

Log In Sign Up

doujindesu.xxx
robots.txt

Robots Exclusion Standard data for doujindesu.xxx

Archived Snapshots

Resource Scan

Scan Details

Site Domain	doujindesu.xxx
Base Domain	doujindesu.xxx
Scan Status	Failed
Failure Stage	Fetching resource.
Failure Reason	Server returned a client error.
Last Scan	2025-02-01T22:17:00+00:00
Next Scan	2025-05-02T22:17:00+00:00

Last Successful Scan

Scanned	2024-03-16T12:58:54+00:00
URL	https://doujindesu.xxx/robots.txt
Redirect	https://doujindesu.tv/robots.txt
Redirect Domain	doujindesu.tv
Redirect Base	doujindesu.tv
Domain IPs	104.21.39.166, 172.67.146.197, 2606:4700:3030::6815:27a6, 2606:4700:3032::ac43:92c5
Redirect IPs	104.26.8.62, 104.26.9.62, 172.67.75.187, 2606:4700:20::681a:83e, 2606:4700:20::681a:93e, 2606:4700:20::ac43:4bbb
Response IP	104.26.9.62
Found	Yes
Hash	47135392598f6164d91093b52adc774d288e4e4f359f8f54b219bc5254c93c1d
SimHash	b810bd48c574

Groups

*

Rule

Path

Disallow

/includes/

Disallow

/themes/

Disallow

/search/

Disallow

/memeen/

Allow

/themes/front/*/css/

Allow

/themes/front/*/images/

Allow

/themes/front/*/js/

Allow

/themes/front/*/fonts/

Allow

/themes/front/*/images/*.jpg

Allow

/themes/front/*/images/*.png

Allow

/themes/front/*/images/*.gif

Back to top

Other Records

Field

Value

sitemap

https://doujindesu.tv/sitemap.xml

Back to top

Comments

robots.txt
This file is to prevent the crawling and indexing of certain parts
of your site by web crawlers and spiders run by sites like Yahoo!
and Google. By telling these "robots" where not to go on your site,
you save bandwidth and server resources.
This file will be ignored unless it is at the root of your host:
Used: http://example.com/robots.txt
Ignored: http://example.com/site/robots.txt
For more information about the robots.txt standard, see:
http://www.robotstxt.org/wc/robots.html
For syntax checking, see:
http://www.sxw.org.uk/computing/robots/check.html
Disallow directories
Disallow paths
Allow themes
Allow content images

Back to top