preservearticles.com
robots.txt

Robots Exclusion Standard data for preservearticles.com

Archived Snapshots

Resource Scan

Scan Details

Site Domain	preservearticles.com
Base Domain	preservearticles.com
Scan Status	Ok
Last Scan	2025-10-03T01:31:57+00:00
Next Scan	2025-10-10T01:31:57+00:00

Last Scan

Scanned	2025-10-03T01:31:57+00:00
URL	https://preservearticles.com/robots.txt
Domain IPs	104.21.50.145, 172.67.207.2, 2606:4700:3033::6815:3291, 2606:4700:3034::ac43:cf02
Response IP	104.21.50.145
Found	Yes
Hash	9f71b7e05f6ae638c924a6c5a25bb6e20eb54bedfd3c905d2bf9646bc7d8f397
SimHash	65394153c6b1

Groups

*

Rule	Path
Disallow	/cgi-bin/
Disallow	/wp-admin/
Disallow	/wp-includes/
Disallow	/wp-content/plugins/
Disallow	/feed/
Disallow	*/feed/
Disallow	/index.php
Disallow	/xmlrpc.php
Disallow	/search?
Disallow	/search/
Disallow	/page/
Disallow	/author/
Disallow	/home/
Disallow	/?
Disallow	/?attachment_id

Rule

Path

Disallow

/cgi-bin/

Disallow

/wp-admin/

Disallow

/wp-includes/

Disallow

/wp-content/plugins/

Disallow

/feed/

Disallow

*/feed/

Disallow

/index.php

Disallow

/xmlrpc.php

Disallow

/search?

Disallow

/search/

Disallow

/page/

Disallow

/author/

Disallow

/home/

Disallow

/?attachment_id

*

No rules defined. All paths allowed.

Other Records

Field	Value
crawl-delay	15

Field

Value

crawl-delay

ia_archiver

Rule	Path
Disallow

Rule

Path

Disallow

mediapartners-google*

Rule	Path
Allow	/

Rule

Path

Allow

adsbot-google

Rule	Path
Allow	/

Rule

Path

Allow

googlebot-mobile

Rule	Path
Allow	/

Rule

Path

Allow

preservearticles.comrobots.txt

Resource Scan

Scan Details

Last Scan

Groups

*

*

Other Records

ia_archiver

mediapartners-google*

adsbot-google

googlebot-mobile

preservearticles.com
robots.txt