editorsweblog.org
robots.txt

Robots Exclusion Standard data for editorsweblog.org

Archived Snapshots

Resource Scan

Scan Details

Site Domain	editorsweblog.org
Base Domain	editorsweblog.org
Scan Status	Ok
Last Scan	2024-09-13T17:40:59+00:00
Next Scan	2024-10-13T17:40:59+00:00

Last Scan

Scanned	2024-09-13T17:40:59+00:00
URL	http://www.editorsweblog.org/robots.txt
Redirect	https://wan-ifra.org/robots.txt
Redirect Domain	wan-ifra.org
Redirect Base	wan-ifra.org
Domain IPs	212.1.56.246
Redirect IPs	192.124.249.12
Response IP	192.124.249.12
Found	Yes
Hash	23c9d75858ee111939207c6a70b0f64e1066318007cc2fe643af18c0348e0e79
SimHash	c8e6c9517ea1

Groups

*

Rule	Path
Disallow	/wp-admin/
Disallow	/wp-includes/
Disallow	/wp-content/plugins/
Disallow	/wp-content/themes/
Disallow	/cgi-bin/
Disallow	/trackback/
Disallow	/xmlrpc.php
Disallow	/wp-login.php
Disallow	/wp-signup.php
Disallow	/?s=*
Disallow	/search/
Disallow	/search/*
Disallow	/wp-json/
Disallow	/*/trackback/
Disallow	/*/comments/
Disallow	/?add-to-cart=
Disallow	/?orderby=
Disallow	/?filter_
Disallow	/cdn-cgi/bm/cv/
Disallow	/cdn-cgi/challenge-platform/
Allow	/wp-content/uploads/
Allow	/wp-content/cache/

Rule

Path

Disallow

/wp-admin/

Disallow

/wp-includes/

Disallow

/wp-content/plugins/

Disallow

/wp-content/themes/

Disallow

/cgi-bin/

Disallow

/trackback/

Disallow

/xmlrpc.php

Disallow

/wp-login.php

Disallow

/wp-signup.php

Disallow

/?s=*

Disallow

/search/

Disallow

/search/*

Disallow

/wp-json/

Disallow

/*/trackback/

Disallow

/*/comments/

Disallow

/*?add-to-cart=*

Disallow

/*?orderby=*

Disallow

/*?filter_*

Disallow

/cdn-cgi/bm/cv/

Disallow

/cdn-cgi/challenge-platform/

Allow

/wp-content/uploads/

Allow

/wp-content/cache/

ahrefsbot
semrushbot
mj12bot
dotbot
nuclei
wikido
riddler
petalbot
zoominfobot
go-http-client
node/simplecrawler
cazoodlebot
dotbot/1.0
gigabot
barkrowler
blexbot
magpie-crawler

Rule	Path
Disallow	/

Rule

Path

Disallow

/

googlebot

Rule	Path
Disallow	/*/feed/

Rule

Path

Disallow

/*/feed/

bingbot

Rule	Path
Disallow	/*/feed/

Rule

Path

Disallow

/*/feed/

slurp

Rule	Path
Disallow	/*/feed/

Rule

Path

Disallow

/*/feed/

*

Rule	Path
Allow	/*/feed/

Rule

Path

Allow

/*/feed/

Back to top

Other Records

Field	Value
sitemap	https://wan-ifra.org/sitemap_index.xml

Field

Value

sitemap

https://wan-ifra.org/sitemap_index.xml

Back to top

Comments

Global rules
-----------------
Prevent crawling CF challenge URLs
Allow access to necessary assets
Sitemap
-----------------
Ban bots that don't benefit us.
--------------------------------
Block feeds for search engines to reduce server load
-------------------------------
Allow feeds for other user agents
-----------------------------------

Back to top

editorsweblog.orgrobots.txt

Resource Scan

Scan Details

Last Scan

Groups

*

ahrefsbotsemrushbotmj12botdotbotnucleiwikidoriddlerpetalbotzoominfobotgo-http-clientnode/simplecrawlercazoodlebotdotbot/1.0gigabotbarkrowlerblexbotmagpie-crawler

googlebot

bingbot

slurp

*

Other Records

Comments

editorsweblog.org
robots.txt

ahrefsbot
semrushbot
mj12bot
dotbot
nuclei
wikido
riddler
petalbot
zoominfobot
go-http-client
node/simplecrawler
cazoodlebot
dotbot/1.0
gigabot
barkrowler
blexbot
magpie-crawler