sourcearchive.com
robots.txt

Robots Exclusion Standard data for sourcearchive.com

Resource Scan

Scan Details

Site Domain sourcearchive.com
Base Domain sourcearchive.com
Scan Status Ok
Last Scan2025-09-23T00:56:14+00:00
Next Scan 2025-10-23T00:56:14+00:00

Last Scan

Scanned2025-09-23T00:56:14+00:00
URL https://sourcearchive.com/robots.txt
Domain IPs 104.21.53.204, 172.67.218.177, 2606:4700:3035::6815:35cc, 2606:4700:3036::ac43:dab1
Response IP 104.21.53.204
Found Yes
Hash 0c6646fc234a7bbc0b34d991e36d5116d1052193ca15dd65c06b92750fef35cc
SimHash 892c6a80a45a

Groups

*

Rule Path
Disallow /wp-admin/
Disallow /component/*
Disallow /wp-login.php*
Disallow /cdn-cgi/
Disallow /?author=
Disallow /author/
Disallow /feed/$
Disallow /tag/
Disallow /search/
Disallow /?s=
Disallow /?__hstc=
Disallow /p%3D*
Disallow /comment-page
Disallow /*comment-page*
Disallow /*?yith
Disallow /?gclid*
Disallow */feed/
Disallow /?wc-ajax*

Other Records

Field Value
sitemap https://sourcearchive.com/sitemap_index.xml

Comments

  • START YOAST BLOCK
  • ---------------------------
  • ---------------------------
  • END YOAST BLOCK