pdfroom.com
robots.txt

Robots Exclusion Standard data for pdfroom.com

Resource Scan

Scan Details

Site Domain pdfroom.com
Base Domain pdfroom.com
Scan Status Ok
Last Scan2024-10-26T15:53:33+00:00
Next Scan 2024-11-02T15:53:33+00:00

Last Scan

Scanned2024-10-26T15:53:33+00:00
URL https://pdfroom.com/robots.txt
Domain IPs 104.21.48.223, 172.67.188.59, 2606:4700:3033::ac43:bc3b, 2606:4700:3035::6815:30df
Response IP 104.21.48.223
Found Yes
Hash c6ef991bb37d4a81234fcdca3e7fbb4ab563234a688bd2ab79da05ad9abfa334
SimHash 591075464b56

Groups

*

Rule Path
Disallow /books/*/*/download$
Disallow /preview/books/*
Disallow /embed/books/*
Disallow /cdn-cgi/
Disallow /*/preview/books/*
Disallow /*/embed/books/*
Disallow /*/books/*/*/download$