petsuppliesplus.com
robots.txt
Robots Exclusion Standard data for petsuppliesplus.com
Resource Scan
Scan Details
Site Domain | petsuppliesplus.com |
Base Domain | petsuppliesplus.com |
Scan Status | Ok |
Last Scan | 2024-06-03T22:08:31+00:00 |
Next Scan | 2024-07-03T22:08:31+00:00 |
Last Scan
Scanned | 2024-06-03T22:08:31+00:00 |
URL | https://petsuppliesplus.com/robots.txt |
Redirect | https://www.petsuppliesplus.com/robots.txt |
Redirect Domain | www.petsuppliesplus.com |
Redirect Base | petsuppliesplus.com |
Domain IPs | 20.114.176.185 |
Redirect IPs | 20.114.176.185 |
Response IP | 20.114.176.185 |
Found | Yes |
Hash | f2cf7363156c141943aa5c1982d74f25ab768cae737b88e85adbd49f8ef1f267 |
SimHash | 55dece31af98 |
Groups
*
Rule | Path |
---|---|
Allow | / |
Disallow | /api/ |
Disallow | /search |
Disallow | /sitecore/ |
aihitbot
barkrowler
bdcbot
blexbot
blp_bbot
brightbot
brokenlinkcheck.com
buck
ccbot
cliqzbot
curl/7.54.0
cyencebot
domaincrawler
dow jones searchbot
exabot
extlinksbot
femtosearchbot
fever
garlikcrawler
gigabot
gobuster
grapeshotcrawler
heritrix
istellabot
jersey
jobkicks
libwww-perl
linkdexbot
linkpadbot
ltx71 - (http://ltx71.com/)
lua-resty-http
lumtelbot
magpie-crawler
magus bot
mail.ru_bot
megaindex.ru
mozilla/5.0 (compatible; msie 10.0; windows nt 6.1; trident/6.0) linkcheck by siteimprove.com
mozilla/5.0 (compatible; msie 10.0; windows nt 6.1; trident/6.0) sitecheck-sitecrawl by siteimprove.com
nl-crawler
onpagebot
riddler
scoutjet
scrapy
seekport
seznambot
siteimprove
smtbot
uptimerobot
velenpublicwebcrawler
wget
yacybot
yeti
yisouspider
yunsecuritybot
zoominfobot
Rule | Path |
---|---|
Disallow | / |
Other Records
Field | Value |
---|---|
sitemap | https://www.petsuppliesplus.com/sitemap.xml |