apanews.net
robots.txt

Robots Exclusion Standard data for apanews.net

Resource Scan

Scan Details

Site Domain apanews.net
Base Domain apanews.net
Scan Status Failed
Failure StageFetching resource.
Failure ReasonServer returned a client error.
Last Scan2025-11-18T15:02:19+00:00
Next Scan 2026-02-16T15:02:19+00:00

Last Successful Scan

Scanned2025-07-22T02:14:41+00:00
URL https://apanews.net/robots.txt
Domain IPs 104.26.2.161, 104.26.3.161, 172.67.75.206, 2606:4700:20::681a:2a1, 2606:4700:20::681a:3a1, 2606:4700:20::ac43:4bce
Response IP 104.26.2.161
Found Yes
Hash bc3e4d54011e6b9480bb5d9e81d17d1e80a8bd460f69deaf9216ac473013fb7d
SimHash 5b2050d4869b

Groups

*

Rule Path
Disallow /wp-admin/
Disallow /wp-login.php
Disallow /wp-register.php
Disallow /?s=*
Disallow /search/
Allow /wp-includes/js/
Allow /wp-content/themes/
Allow /wp-content/plugins/
Allow /wp-content/uploads/

archive.org_bot

Rule Path
Disallow

ia_archiver

Rule Path
Disallow

googlebot

Rule Path
Disallow

bingbot

Rule Path
Disallow

slurp

Rule Path
Disallow

facebookexternalhit

Rule Path
Disallow

twitterbot

Rule Path
Disallow

googlebot-image

Rule Path
Disallow

googlebot-news

Rule Path
Disallow

Other Records

Field Value
sitemap https://apanews.net/sitemap.xml
sitemap https://fr.apanews.net/sitemap.xml

Comments

  • Bloquer les pages sensibles de WordPress
  • Archive.org (Wayback Machine)
  • Autoriser Googlebot
  • Autoriser Bingbot
  • bloquer search Bingbot
  • User-agent: *
  • Disallow: /search/
  • Disallow: /?s=
  • Autoriser Slurp
  • Autoriser facebookexternalhit
  • Autoriser Twitterbot
  • Autoriser Googlebot-Image
  • Autoriser Googlebot-News