infomercado.pe
robots.txt

Robots Exclusion Standard data for infomercado.pe

Resource Scan

Scan Details

Site Domain infomercado.pe
Base Domain infomercado.pe
Scan Status Ok
Last Scan2024-09-19T02:15:29+00:00
Next Scan 2024-09-26T02:15:29+00:00

Last Scan

Scanned2024-09-19T02:15:29+00:00
URL https://infomercado.pe/robots.txt
Domain IPs 2406:da18:9d0:143f:2124:4e9c:36a9:d9de, 52.221.42.138
Response IP 52.221.42.138
Found Yes
Hash 3fb3869af24e340eb317dc2dae373edd7287a6987ab17e6a70788714637fb634
SimHash e0f45a82c1d2

Groups

*

Rule Path Comment
Allow /wp-admin/admin-ajax.php -
Disallow /wp-admin/ block access to admin section
Disallow /wp-login.php block access to admin section
Disallow /search/ block access to internal search result pages
Disallow *?s=* block access to internal search result pages
Disallow *?p=* block access to pages for which permalinks fails
Disallow *%26p%3D* block access to pages for which permalinks fails
Disallow *%26preview%3D* block access to preview pages
Disallow /404-error/ block access to 404 page
Disallow /*?utm_source= block access to utm parameters
Disallow /*?utm_medium= block access to utm parameters
Disallow /*?utm_campaign= block access to utm parameters
Disallow /*?amp$ block access to amp old content
Disallow /*%26amp$ block access to amp old content
Disallow /*%26amp%3B$ block access to amp old content
Disallow /*?amp= block access to amp old content
Disallow /*%26amp%3D block access to amp old content
Disallow /detroitchicago/ -
Disallow /beardeddragon/ -
Disallow /porpoiseant/ -
Disallow /tardisrocinante/ -
Disallow /ezossp/ -
Disallow /ezais/ -
Disallow /ezoic/ -
Allow /feed/$ -
Disallow /feed -
Disallow /comments/feed -
Disallow /*/feed/$ -
Disallow /*/feed/rss/$ -
Disallow /*/trackback/$ -
Disallow /*/*/feed/$ -
Disallow /*/*/feed/rss/$ -
Disallow /*/*/trackback/$ -
Disallow /*/*/*/feed/$ -
Disallow /*/*/*/feed/rss/$ -
Disallow /*/*/*/trackback/$ -
Allow /*.js$ -
Allow /*.css$ -
Disallow /*.pdf$ -

googlebot-image

Rule Path
Allow /wp-content/uploads/

adsbot-google

Rule Path
Allow /

googlebot-mobile

Rule Path
Allow /

msiecrawler

Rule Path
Disallow /

webcopier

Rule Path
Disallow /

httrack

Rule Path
Disallow /

microsoft.url.control

Rule Path
Disallow /

libwww

Rule Path
Disallow /

baiduspider

Rule Path
Disallow /

gurujibot

Rule Path
Disallow /

hl_ftien_spider

Rule Path
Disallow /

sogou spider

Rule Path
Disallow /

yeti

Rule Path
Disallow /

yodaobot

Rule Path
Disallow /

Comments

  • Disallow: /author/ #block access to author pages
  • Ezoic stuff
  • Impedir el acceso a los diferentes feed que genere la página
  • Impedir URLs terminadas en /trackback/ que sirven como Trackback URL.
  • Evita bloqueos de CSS y JS.
  • Bloquear todos los pdfs
  • Bloquear parámetros
  • Lista de bots que deberías permitir.
  • Lista de bots bloqueados

Warnings

  • 1 invalid line.