polahoki.net
robots.txt

Robots Exclusion Standard data for polahoki.net

Resource Scan

Scan Details

Site Domain polahoki.net
Base Domain polahoki.net
Scan Status Failed
Failure StageFetching resource.
Failure ReasonCouldn't connect to server.
Last Scan2024-10-31T01:26:31+00:00
Next Scan 2025-01-29T01:26:31+00:00

Last Successful Scan

Scanned2023-12-14T00:52:43+00:00
URL https://polahoki.net/robots.txt
Domain IPs 104.21.78.6, 172.67.214.68, 2606:4700:3032::6815:4e06, 2606:4700:3032::ac43:d644
Response IP 172.67.214.68
Found Yes
Hash e22d14083f98319b176568b5febac2333fb4513a309a6bc103b935ce267f16c0
SimHash 2919d337d7d1

Groups

*

Rule Path
Disallow /cgi-bin/
Disallow /tmp/
Allow /*.jpg$
Allow /*.jpeg$
Allow /*.gif$
Allow /*.png$
Allow /*.webp$

dmca.com page protection crawling service

Rule Path
Allow /

googlebot-mobile

Rule Path
Allow /

googlebot

Rule Path
Allow /

googlebot-image

Rule Path
Allow /

googlebot-news

Rule Path
Allow /

googlebot-video

Rule Path
Allow /

bingbot

Rule Path
Allow /

slurp

Rule Path
Allow /

duckduckbot

Rule Path
Allow /

msnbot

Rule Path
Allow /

yahoo pipes 1.0

Rule Path
Allow /

yahoo! slurp

Rule Path
Allow /

yandexbot

Rule Path
Allow /

yandeximages

Rule Path
Allow /

yandexnews

Rule Path
Allow /

yandexwebmaster

Rule Path
Allow /

yandexpagechecker

Rule Path
Allow /

zyborg

Rule Path
Allow /

exabot

Rule Path
Allow /

facebot

Rule Path
Allow /

ia_archiver

Rule Path
Allow /

archive.org_bot

Rule Path
Allow /

architextspider

Rule Path
Allow /

feedfetcher-google

Rule Path
Allow /
Allow /

linkedinbot

Rule Path
Allow /

Other Records

Field Value
sitemap https://marthanew.com/sitemap.xml

Warnings

  • 2 invalid lines.
  • `user-agen` is not a known field.