pdfdoku.com
robots.txt

Robots Exclusion Standard data for pdfdoku.com

Resource Scan

Scan Details

Site Domain pdfdoku.com
Base Domain pdfdoku.com
Scan Status Failed
Failure StageFetching resource.
Failure ReasonCouldn't connect to server.
Last Scan2025-09-20T12:41:45+00:00
Next Scan 2025-12-19T12:41:45+00:00

Last Successful Scan

Scanned2022-10-14T07:25:13+00:00
URL https://pdfdoku.com/robots.txt
Response IP 104.21.74.68, 172.67.200.43
Found Yes
Hash 222607470ac4d5a13f9c03113e0314b7ba345ba40322ea2a4ad286a087357404
SimHash 283d4c10e513

Groups

*

Rule Path
Disallow /download/*
Disallow /cdn-cgi/

ia_archiver

Rule Path
Disallow /

ahrefsbot

Rule Path
Disallow /

semrushbot

Rule Path
Disallow /

Other Records

Field Value
sitemap https://pdfdoku.com/sitemap.xml