cdn.archpaper.com
robots.txt

Robots Exclusion Standard data for cdn.archpaper.com

Resource Scan

Scan Details

Site Domain cdn.archpaper.com
Base Domain archpaper.com
Scan Status Ok
Last Scan2024-04-17T17:09:38+00:00
Next Scan 2024-05-17T17:09:38+00:00

Last Scan

Scanned2024-04-17T17:09:38+00:00
URL https://cdn.archpaper.com/robots.txt
Redirect https://www.archpaper.com/robots.txt
Redirect Domain www.archpaper.com
Redirect Base archpaper.com
Domain IPs 2600:9000:23d0:1400:1d:ec91:a140:93a1, 2600:9000:23d0:2800:1d:ec91:a140:93a1, 2600:9000:23d0:2c00:1d:ec91:a140:93a1, 2600:9000:23d0:5c00:1d:ec91:a140:93a1, 2600:9000:23d0:5e00:1d:ec91:a140:93a1, 2600:9000:23d0:7a00:1d:ec91:a140:93a1, 2600:9000:23d0:a000:1d:ec91:a140:93a1, 2600:9000:23d0:f600:1d:ec91:a140:93a1, 65.9.112.101, 65.9.112.115, 65.9.112.44, 65.9.112.91
Redirect IPs 18.161.97.100, 18.161.97.110, 18.161.97.25, 18.161.97.59, 2600:9000:23d0:4e00:1d:ec91:a140:93a1, 2600:9000:23d0:6000:1d:ec91:a140:93a1, 2600:9000:23d0:800:1d:ec91:a140:93a1, 2600:9000:23d0:b400:1d:ec91:a140:93a1, 2600:9000:23d0:c600:1d:ec91:a140:93a1, 2600:9000:23d0:c800:1d:ec91:a140:93a1, 2600:9000:23d0:d800:1d:ec91:a140:93a1, 2600:9000:23d0:f800:1d:ec91:a140:93a1
Response IP 18.165.171.80
Found Yes
Hash 40d887023994c151e87cfa1cda3b972f6cafb2c881660560491bb4e433b9ac5b
SimHash 6bb4ccc2c693

Groups

*

Rule Path
Disallow /cgi-bin/
Disallow /wp-admin/
Disallow /xmlrpc.php
Disallow /?s=
Disallow /search/
Disallow /calendar/
Disallow /?post_type=tribe_events*

Other Records

Field Value
crawl-delay 6

googlebot-image

Rule Path
Allow /
Disallow /cgi-bin/
Disallow /wp-admin/
Disallow /xmlrpc.php
Disallow /?s=
Disallow /search/
Disallow /calendar/
Disallow /?post_type=tribe_events*

googlebot-mobile

Rule Path
Allow /
Disallow /cgi-bin/
Disallow /wp-admin/
Disallow /xmlrpc.php
Disallow /?s=
Disallow /search/
Disallow /?post_type=tribe_events*

Other Records

Field Value
crawl-delay 6

adsbot-google

Rule Path
Allow /
Disallow /cgi-bin/
Disallow /wp-admin/
Disallow /xmlrpc.php
Disallow /?s=
Disallow /search/
Disallow /calendar/
Disallow /?post_type=tribe_events*

googlebot

Rule Path
Allow /
Disallow /cgi-bin/
Disallow /wp-admin/
Disallow /xmlrpc.php
Disallow /?s=*
Disallow /search/*
Disallow /calendar/*
Disallow /archives/*
Disallow /?post_type=tribe_events*

mediapartners-google

Rule Path
Allow /

ahrefsbot

Rule Path
Allow /
Disallow /cgi-bin/
Disallow /wp-admin/
Disallow /xmlrpc.php
Disallow /?s=
Disallow /search/
Disallow /calendar/

Other Records

Field Value
crawl-delay 6

yandex

Rule Path
Disallow *

Other Records

Field Value
sitemap https://www.archpaper.com/sitemap_index.xml