pdf-archive.com
robots.txt

Robots Exclusion Standard data for pdf-archive.com

Archived Snapshots

Resource Scan

Scan Details

Site Domain	pdf-archive.com
Base Domain	pdf-archive.com
Scan Status	Ok
Last Scan	2024-10-04T15:51:42+00:00
Next Scan	2024-10-11T15:51:42+00:00

Last Scan

Scanned	2024-10-04T15:51:42+00:00
URL	https://pdf-archive.com/robots.txt
Redirect	https://www.pdf-archive.com/robots.txt
Redirect Domain	www.pdf-archive.com
Redirect Base	pdf-archive.com
Domain IPs	13.37.6.18
Redirect IPs	51.158.54.25
Response IP	51.158.54.25
Found	Yes
Hash	ce2b0a4823895a9318e6f1da62242da5e487e0124b3cb6008e0f0c7149cc896c
SimHash	eb041840c853

Groups

gptbot

Rule	Path
Disallow	/

Rule

Path

Disallow

/

turnitinbot

Rule	Path
Disallow	/

Rule

Path

Disallow

/

sirdatabot

Rule	Path
Disallow	/

Rule

Path

Disallow

/

bytespider

Rule	Path
Disallow	/

Rule

Path

Disallow

/

yacybot

Rule	Path
Disallow	/

Rule

Path

Disallow

/

*

Rule	Path
Disallow

Rule

Path

Disallow

Back to top

Other Records

Field	Value
sitemap	https://www.pdf-archive.com/sitemap.xml

Field

Value

sitemap

https://www.pdf-archive.com/sitemap.xml

Back to top

pdf-archive.comrobots.txt

Resource Scan

Scan Details

Last Scan

Groups

gptbot

turnitinbot

sirdatabot

bytespider

yacybot

*

Other Records

pdf-archive.com
robots.txt