pupuweb.com
robots.txt

Robots Exclusion Standard data for pupuweb.com

Resource Scan

Scan Details

Site Domain pupuweb.com
Base Domain pupuweb.com
Scan Status Ok
Last Scan2024-09-15T16:44:57+00:00
Next Scan 2024-09-22T16:44:57+00:00

Last Scan

Scanned2024-09-15T16:44:57+00:00
URL https://pupuweb.com/robots.txt
Domain IPs 104.21.24.95, 172.67.218.38, 2606:4700:3032::6815:185f, 2606:4700:3034::ac43:da26
Response IP 172.67.218.38
Found Yes
Hash ff8b602769522fac8019e29646d0bde0f53d41aa5a3d5fc33d8467a76c10ff7e
SimHash b459484ac6b4

Groups

*

Rule Path
Disallow /cgi-bin/
Disallow /wp-admin/
Disallow /wp-includes/
Disallow /archives/
Disallow /comments/feed/
Disallow /trackback/
Disallow /index.php
Disallow /xmlrpc.php
Disallow *?wptheme
Disallow *?comments*
Disallow *?wordfence_lh*
Disallow *?amp*
Disallow *?noamp*
Disallow /search?
Disallow /?p*
Disallow */feed/*
Disallow */embed/*
Disallow */page/*
Disallow */category/*
Disallow */tag/*

gptbot

Rule Path
Disallow /

google-extended

Rule Path
Disallow /

anthropic-ai

Rule Path
Disallow /

bytespider

Rule Path
Disallow /

claudebot

Rule Path
Disallow /

claude-web

Rule Path
Disallow /

ccbot

Rule Path
Disallow /

doc

Rule Path
Disallow /

download ninja

Rule Path
Disallow /

fetch

Rule Path
Disallow /

hmse_robot

Rule Path
Disallow /

httrack

Rule Path
Disallow /

k2spider

Rule Path
Disallow /

larbin

Rule Path
Disallow /

libwww

Rule Path
Disallow /

linko

Rule Path
Disallow /

microsoft.url.control

Rule Path
Disallow /

msiecrawler

Rule Path
Disallow /

npbot

Rule Path
Disallow /

offline explorer

Rule Path
Disallow /

sitecheck.internetseer.com

Rule Path
Disallow /

sitesnagger

Rule Path
Disallow /

teleport

Rule Path
Disallow /

teleportpro

Rule Path
Disallow /

ubicrawler

Rule Path
Disallow /

webcopier

Rule Path
Disallow /

webreaper

Rule Path
Disallow /

webstripper

Rule Path
Disallow /

webzip

Rule Path
Disallow /

wget

Rule Path
Disallow /

xenu

Rule Path
Disallow /

zao

Rule Path
Disallow /

zealbot

Rule Path
Disallow /

zyborg

Rule Path
Disallow /

facebookbot

Rule Path
Disallow /

ia_archiver

Rule Path
Disallow /

magpie-crawler

Rule Path
Disallow /

mj12bot

Rule Path
Disallow /

newsnow

Rule Path
Disallow /

news-please

Rule Path
Disallow /

omgili

Rule Path
Disallow /

omgilibot

Rule Path
Disallow /

perplexity-ai

Rule Path
Disallow /

perplexitybot

Rule Path
Disallow /

piplbot

Rule Path
Disallow /

adsbot-google

Rule Path
Allow /

mediapartners-google

Rule Path
Allow /

googlebot

Rule Path
Allow /

Other Records

Field Value
sitemap https://pupuweb.com/sitemap.xml
sitemap https://pupuweb.com/sitemap-news.xml

Comments

  • Disallow: /wp-content/