mediakampung.com
robots.txt

Robots Exclusion Standard data for mediakampung.com

Resource Scan

Scan Details

Site Domain mediakampung.com
Base Domain mediakampung.com
Scan Status Failed
Failure ReasonScan timed out.
Last Scan2024-10-30T07:53:58+00:00
Next Scan 2024-11-13T07:53:58+00:00

Last Successful Scan

Scanned2024-10-15T07:48:09+00:00
URL https://mediakampung.com/robots.txt
Domain IPs 103.151.32.13
Response IP 103.151.32.13
Found Yes
Hash e128edb16d311bfd59d916c9a0bc528bf5234fec9fbef620414d8ebfcb653417
SimHash 7911d160df73

Groups

*

Rule Path
Disallow */trackback/
Disallow */xmlrpc.php
Disallow /wp-*.php
Disallow /cgi-bin/
Disallow *?jxrecoid=*
Disallow *?utm_source=*
Disallow *?source=*
Disallow *?_ga
Disallow *%26amp%3Bsortby
Disallow *%26amp%3Bdevice%3Ddesktop
Disallow *edu/pov/d-*
Allow */storage/
Allow *?amp=1
Allow *amp

gptbot

Rule Path
Disallow /

openai

Rule Path
Disallow /

chatgpt-user

Rule Path
Disallow /

ccbot

Rule Path
Disallow /

anthropic-ai

Rule Path
Disallow /

claude-web

Rule Path
Disallow /

perplexitybot

Rule Path
Disallow /

Other Records

Field Value
sitemap https://mediakampung.com/sitemap.xml
sitemap https://mediakampung.com/sitemap-news.xml
sitemap https://mediakampung.com/sitemap-posts.xml
sitemap https://mediakampung.com/sitemap-categories.xml
sitemap https://mediakampung.com/sitemap-tags.xml
sitemap https://mediakampung.com/sitemap-attachment.xml

Comments

  • Robots