kspnews.com
robots.txt

Robots Exclusion Standard data for kspnews.com

Resource Scan

Scan Details

Site Domain kspnews.com
Base Domain kspnews.com
Scan Status Failed
Failure ReasonScan timed out.
Last Scan2024-11-12T15:14:11+00:00
Next Scan 2024-11-26T15:14:11+00:00

Last Successful Scan

Scanned2024-10-27T19:48:08+00:00
URL https://kspnews.com/robots.txt
Domain IPs 121.78.144.194
Response IP 121.78.144.194
Found Yes
Hash 91fc45da0677a3ab47ab68708fb7ea271f989a1d4cd624e1b43f6b3027d74d36
SimHash 6777c940c394

Groups

bingbot

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 30

adsbot-google

Rule Path
Allow /

adsbot-google-mobile

Rule Path
Allow /

mediapartners-google

Rule Path
Allow /

googlebot

Rule Path
Allow

googlebot-news

Rule Path
Allow

blexbot

Rule Path
Disallow /

ahrefsbot

Rule Path
Disallow /

turnitinbot

Rule Path
Disallow /

blexbot

Rule Path
Disallow /

mj12bot

Rule Path
Disallow /

petalbot

Rule Path
Disallow /

serpstatbot

Rule Path
Disallow /

*

Rule Path
Disallow /bbs

*

Rule Path
Disallow /newnews

*

Rule Path
Disallow /admin

*

Rule Path
Disallow /news_skin

*

Rule Path
Disallow /m_g.php

gptbot

Rule Path
Disallow /

ccbot

Rule Path
Disallow /

anthropic-ai

Rule Path
Disallow /

claude-web

Rule Path
Disallow /