wpaa.net
robots.txt

Robots Exclusion Standard data for wpaa.net

Resource Scan

Scan Details

Site Domain wpaa.net
Base Domain wpaa.net
Scan Status Failed
Failure StageFetching resource.
Failure ReasonServer returned a client error.
Last Scan2024-07-24T02:47:50+00:00
Next Scan 2024-10-22T02:47:50+00:00

Last Successful Scan

Scanned2023-09-05T23:38:35+00:00
URL https://wpaa.net/robots.txt
Redirect https://www.wpaa.net/robots.txt
Redirect Domain www.wpaa.net
Redirect Base wpaa.net
Domain IPs 108.157.177.101, 108.157.177.114, 108.157.177.120, 108.157.177.57
Redirect IPs 65.9.112.51, 65.9.112.53, 65.9.112.84, 65.9.112.85
Response IP 52.222.144.7
Found Yes
Hash 9fe4d88926d79d4fe4c14d87801b2dc32c53282ff51bc40d9eadabab65cb9e16
SimHash f51cfc8ad2d0

Groups

*

Rule Path
Disallow /leads/
Disallow *.pdf$
Disallow /inventory.aspx*
Disallow /inventory-*.html

bingbot
msnbot
semrushbot
semrushbot-sa
scoutjet
siteimprove.com
match by siteimprove.com
linkcheck by siteimprove.com
sitecheck-sitecrawl by siteimprove.com

Rule Path
Disallow /leads/
Disallow *.pdf$
Disallow /inventory.aspx*
Disallow /inventory-*.html

Other Records

Field Value
crawl-delay 6

mj12bot
omniexplorer_bot
wells search ii 0.0
heritrix/1.10.0
shopwiki
scanalert
copernic
psbot
python-urllib
baiduspider
yandex
ahrefsbot
trovitbot
blexbot
seokicks-robot
cliqzbot
mauibot
bubing
qwantify
tweetmemebot
autobot
seokicks
petalbot
barkrowler
zoominfobot
sogou spider
tinytestbot

Rule Path
Disallow /

Other Records

Field Value
sitemap https://www.wpaa.net/sitemap.xml
sitemap https://www.wpaa.net/sitemap-video.xml
sitemap https://www.wpaa.net/sitemap-geo.xml
sitemap https://www.wpaa.net/sitemap-images.xml

Warnings

  • 2 invalid lines.