scfp.ca
robots.txt

Robots Exclusion Standard data for scfp.ca

Archived Snapshots

Resource Scan

Scan Details

Site Domain	scfp.ca
Base Domain	scfp.ca
Scan Status	Failed
Failure Stage	Fetching resource.
Failure Reason	Server returned a client error.
Last Scan	2025-09-10T07:58:20+00:00
Next Scan	2025-12-09T07:58:20+00:00

Last Successful Scan

Scanned	2025-04-21T07:42:53+00:00
URL	https://scfp.ca/robots.txt
Redirect	https://scfp.ca/sites/cupe/robots-scfp.txt
Domain IPs	151.101.130.133, 151.101.194.133, 151.101.2.133, 151.101.66.133
Response IP	151.101.194.133
Found	Yes
Hash	3bd2bcd12d3dc2a7bf81ae37ae6cae812845fa4284446a61f3fc692942d4e33a
SimHash	3b94bd1ac570

Groups

*

Rule	Path
Disallow	/collective-agreement/
Disallow	/site-section/
Disallow	/includes/
Disallow	/misc/
Disallow	/modules/
Disallow	/profiles/
Disallow	/scripts/
Disallow	/themes/
Disallow	/CHANGELOG.txt
Disallow	/cron.php
Disallow	/INSTALL.mysql.txt
Disallow	/INSTALL.pgsql.txt
Disallow	/INSTALL.sqlite.txt
Disallow	/install.php
Disallow	/INSTALL.txt
Disallow	/LICENSE.txt
Disallow	/MAINTAINERS.txt
Disallow	/update.php
Disallow	/UPGRADE.txt
Disallow	/xmlrpc.php
Disallow	/sites/cupe/libraries/
Disallow	/sites/cupe/themes/
Disallow	/admin/
Disallow	/comment/reply/
Disallow	/filter/tips/
Disallow	/node/add/
Disallow	/forward
Disallow	/search
Disallow	/search/
Disallow	/recherche
Disallow	/recherche/
Disallow	/recherche?list
Disallow	/recherche?key_i
Disallow	/medias?list
Disallow	/about?list
Disallow	/search?f
Disallow	/search?key_i
Disallow	/Search/
Disallow	/Search?f
Disallow	/salle-de-presse?list
Disallow	/Search?key_i
Disallow	/recherche?f
Disallow	/newsroom?f
Disallow	/issues-research?f
Disallow	/pubs
Disallow	/workshops
Disallow	/enjeux-recherche?f
Disallow	/salle-de-presse?f
Disallow	/evenements?f
Disallow	/search-workshops
Disallow	/user/register/
Disallow	/user?
Disallow	/user/password/
Disallow	/user/login/
Disallow	/user/logout/
Disallow	/?q=admin%2F
Disallow	/?q=comment%2Freply%2F
Disallow	/?q=filter%2Ftips%2F
Disallow	/?q=node%2Fadd%2F
Disallow	/?q=search%2F
Disallow	/?q=user%2Fpassword%2F
Disallow	/?path=
Disallow	/?q=user%2Fregister%2F
Disallow	/?q=user%2Flogin%2F
Disallow	/?q=user%2Flogout%2F
Disallow	/?key_i

Rule

Path

Disallow

/collective-agreement/

Disallow

/site-section/

Disallow

/includes/

Disallow

/misc/

Disallow

/modules/

Disallow

/profiles/

Disallow

/scripts/

Disallow

/themes/

Disallow

/CHANGELOG.txt

Disallow

/cron.php

Disallow

/INSTALL.mysql.txt

Disallow

/INSTALL.pgsql.txt

Disallow

/INSTALL.sqlite.txt

Disallow

/install.php

Disallow

/INSTALL.txt

Disallow

/LICENSE.txt

Disallow

/MAINTAINERS.txt

Disallow

/update.php

Disallow

/UPGRADE.txt

Disallow

/xmlrpc.php

Disallow

/sites/cupe/libraries/

Disallow

/sites/cupe/themes/

Disallow

/admin/

Disallow

/comment/reply/

Disallow

/filter/tips/

Disallow

/node/add/

Disallow

/forward

Disallow

/search

Disallow

/search/

Disallow

/recherche

Disallow

/recherche/

Disallow

/recherche?list

Disallow

/recherche?key_i

Disallow

/medias?list

Disallow

/about?list

Disallow

/search?f

Disallow

/search?key_i

Disallow

/Search/

Disallow

/Search?f

Disallow

/salle-de-presse?list

Disallow

/Search?key_i

Disallow

/recherche?f

Disallow

/newsroom?f

Disallow

/issues-research?f

Disallow

/pubs

Disallow

/workshops

Disallow

/enjeux-recherche?f

Disallow

/salle-de-presse?f

Disallow

/evenements?f

Disallow

/search-workshops

Disallow

/user/register/

Disallow

/user?

Disallow

/user/password/

Disallow

/user/login/

Disallow

/user/logout/

Disallow

/?q=admin%2F

Disallow

/?q=comment%2Freply%2F

Disallow

/?q=filter%2Ftips%2F

Disallow

/?q=node%2Fadd%2F

Disallow

/?q=search%2F

Disallow

/?q=user%2Fpassword%2F

Disallow

/?path=

Disallow

/?q=user%2Fregister%2F

Disallow

/?q=user%2Flogin%2F

Disallow

/?q=user%2Flogout%2F

Disallow

/*?*key_i

Other Records

Field	Value
crawl-delay	10

Field

Value

crawl-delay

10

Back to top

Other Records

Field	Value
sitemap	http://scfp.ca/sitemap.xml

Field

Value

sitemap

http://scfp.ca/sitemap.xml

Back to top

Comments

robots.txt
This file is to prevent the crawling and indexing of certain parts
of your site by web crawlers and spiders run by sites like Yahoo!
and Google. By telling these "robots" where not to go on your site,
you save bandwidth and server resources.
This file will be ignored unless it is at the root of your host:
Used: http://example.com/robots.txt
Ignored: http://example.com/site/robots.txt
For more information about the robots.txt standard, see:
http://www.robotstxt.org/wc/robots.html
For syntax checking, see:
http://www.sxw.org.uk/computing/robots/check.html
Disallow collective agreements
Disallow site section search
Directories
Files
Paths (clean URLs)
Paths (no clean URLs)

Back to top

scfp.carobots.txt

Resource Scan

Scan Details

Last Successful Scan

Groups

*

Other Records

Other Records

Comments

scfp.ca
robots.txt