wvcheckbook.gov
robots.txt

Robots Exclusion Standard data for wvcheckbook.gov

Resource Scan

Scan Details

Site Domain wvcheckbook.gov
Base Domain wvcheckbook.gov
Scan Status Ok
Last Scan2024-09-23T23:12:00+00:00
Next Scan 2024-10-23T23:12:00+00:00

Last Scan

Scanned2024-09-23T23:12:00+00:00
URL https://www.wvcheckbook.gov/robots.txt
Domain IPs 104.19.218.112, 104.19.219.112, 2606:4700::6813:da70, 2606:4700::6813:db70
Response IP 104.19.218.112
Found Yes
Hash d92fc5a03956f2cdcd141ea1e8e070c328d8d22baceff48f3ae39c3e70dc14e9
SimHash c21251c26e73

Groups

*

Rule Path
Disallow /dataset?*
Disallow /dataset/?*
Disallow /dataset/activity/*
Disallow /dataset/groups/*
Disallow /dataset/showcases/*
Disallow /dataset/*/issues/*
Disallow /dataset/*/resource/*
Disallow /datastore/*
Disallow /datarequest/*
Disallow /group/*?*
Disallow /organization/*?*
Disallow /showcase?*
Disallow /issues/
Disallow /revision/
Disallow /user/*
Disallow /api/
Disallow /cgi-bin
Disallow /wp-admin/
Disallow /wp-content/
Disallow /wp-includes/
Disallow /*.php$
Disallow /*.inc$
Disallow /*.gz$
Disallow /*.wmv$
Disallow /*.cgi$
Disallow /*.xhtml$
Disallow /ar/
Disallow /bg/
Disallow /ca
Disallow /cs_CZ/
Disallow /da_DK/
Disallow /de/
Disallow /dv/
Disallow /el/
Disallow /en/
Disallow /en_AU/
Disallow /en_GB/
Disallow /es/
Disallow /es_AR/
Disallow /fa_IR/
Disallow /fi/
Disallow /fr/
Disallow /he/
Disallow /hr/
Disallow /hu/
Disallow /id/
Disallow /is/
Disallow /it/
Disallow /ja/
Disallow /km/
Disallow /ko_KR/
Disallow /lt/
Disallow /lv/
Disallow /mn_MN/
Disallow /my_MM/
Disallow /ne/
Disallow /nl/
Disallow /no/
Disallow /pl/
Disallow /pt_BR/
Disallow /pt_PT/
Disallow /ro/
Disallow /ru/
Disallow /sk/
Disallow /sl/
Disallow /sq/
Disallow /sr/
Disallow /sr_Latn/
Disallow /sv/
Disallow /th/
Disallow /tl/
Disallow /tr/
Disallow /uk_UA/
Disallow /vi/
Disallow /zh_CN/
Disallow /zh_HK/
Disallow /zh_TW/

*

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 15

jigsaw

Rule Path
Allow /

linkcheck

Rule Path
Allow /

sitecheck-sitecrawl

Rule Path
Allow /

siteimprovebot

Rule Path
Allow /

siteimprovebot-crawler

Rule Path
Allow /

siteimprove_w3c_validator

Rule Path
Allow /

ahrefsbot

Rule Path
Disallow /

baiduspider

Rule Path
Disallow /

blexbot

Rule Path
Disallow /

bubing

Rule Path
Disallow /

buck

Rule Path
Disallow /

censysinspect

Rule Path
Disallow /

dotbot

Rule Path
Disallow /

httrack

Rule Path
Disallow /

jorgee

Rule Path
Disallow /

larbin

Rule Path
Disallow /

mauibot

Rule Path
Disallow /

mj12bot

Rule Path
Disallow /

mozlila

Rule Path
Disallow /

msiecrawler

Rule Path
Disallow /

netestate ne crawler

Rule Path
Disallow /

npbot

Rule Path
Disallow /

petalbot

Rule Path
Disallow /

scrapy

Rule Path
Disallow /

semrushbot

Rule Path
Disallow /

seznambot

Rule Path
Disallow /

sitesnagger

Rule Path
Disallow /

spbot

Rule Path
Disallow /

teleport

Rule Path
Disallow /

teleportpro

Rule Path
Disallow /

xenu

Rule Path
Disallow /

webcopier

Rule Path
Disallow /

webreaper

Rule Path
Disallow /

webstripper

Rule Path
Disallow /

webzip

Rule Path
Disallow /

yandex

Rule Path
Disallow /

yandexbot

Rule Path
Disallow /

zealbot

Rule Path
Disallow /

zyborg

Rule Path
Disallow /

Warnings

  • 2 invalid lines.