pxhere.com
robots.txt

Robots Exclusion Standard data for pxhere.com

Resource Scan

Scan Details

Site Domain pxhere.com
Base Domain pxhere.com
Scan Status Failed
Failure StageFetching resource.
Failure ReasonServer returned a client error.
Last Scan2024-04-11T21:58:15+00:00
Next Scan 2024-07-10T21:58:15+00:00

Last Successful Scan

Scanned2023-09-12T13:23:00+00:00
URL https://pxhere.com/robots.txt
Domain IPs 104.26.12.7, 104.26.13.7, 172.67.70.214, 2606:4700:20::681a:c07, 2606:4700:20::681a:d07, 2606:4700:20::ac43:46d6
Response IP 172.67.70.214
Found Yes
Hash 10965a1b3099534011d692b5af583eaad3290781f82b083f0191e699271df547
SimHash 6f1cda628b1b

Groups

*

Rule Path
Disallow /*/login
Disallow /*/signup
Allow /

yandexbot
yandex
ahrefsbot
duckduckbot
bingbot

Rule Path
Disallow /*/login
Disallow /*/signup
Allow /

Other Records

Field Value
crawl-delay 1

mj12bot
sitebot
dotbot
ocelli
sistrix
shopwiki
wbsearchbot
riddlerbot
linguatools
www.integromedb.org/crawler
ccbot

Rule Path
Disallow /

Other Records

Field Value
sitemap https://pxhere.com/sitemap/sitemap.xml
sitemap https://pxhere.com/sitemap.xml

Warnings

  • 1 invalid line.