pdf-archive.com
robots.txt

Robots Exclusion Standard data for pdf-archive.com

Resource Scan

Scan Details

Site Domain pdf-archive.com
Base Domain pdf-archive.com
Scan Status Ok
Last Scan2024-10-04T15:51:42+00:00
Next Scan 2024-10-11T15:51:42+00:00

Last Scan

Scanned2024-10-04T15:51:42+00:00
URL https://pdf-archive.com/robots.txt
Redirect https://www.pdf-archive.com/robots.txt
Redirect Domain www.pdf-archive.com
Redirect Base pdf-archive.com
Domain IPs 13.37.6.18
Redirect IPs 51.158.54.25
Response IP 51.158.54.25
Found Yes
Hash ce2b0a4823895a9318e6f1da62242da5e487e0124b3cb6008e0f0c7149cc896c
SimHash eb041840c853

Groups

gptbot

Rule Path
Disallow /

turnitinbot

Rule Path
Disallow /

sirdatabot

Rule Path
Disallow /

bytespider

Rule Path
Disallow /

yacybot

Rule Path
Disallow /

*

Rule Path
Disallow

Other Records

Field Value
sitemap https://www.pdf-archive.com/sitemap.xml