Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[REMOVE] "(?:\b)facebookexternalhit(?:\b)" good_bot #208

Open
doxycomp opened this issue Jul 28, 2024 · 1 comment
Open

[REMOVE] "(?:\b)facebookexternalhit(?:\b)" good_bot #208

doxycomp opened this issue Jul 28, 2024 · 1 comment
Assignees

Comments

@doxycomp
Copy link

Removal Request?

Please List the User-Agent string or Referrer to be added/removed

/globalblacklist.conf:BrowserMatchNoCase "(?:\b)facebookexternalhit(?:\b)" good_bot

Facebook is not a good bot and ignoring robots.txt and spamming the server with requests.

For Additions: Please include a log sample 3-5 lines is adequate

It defaults to good_bot in globalblacklist.conf and seemingly cannot be set as bad_bot in blacklist-user-agents.conf

Thank you!

@doxycomp
Copy link
Author

Just in case, sombody else is struggling with this, for the moment we added some lines to the top of our .htaccess, to block the facebook/meta crawlers:

RewriteCond %{HTTP_USER_AGENT} ^facebookexternalhit.* [OR]
RewriteCond %{HTTP_USER_AGENT} ^meta-externalagent.*
RewriteRule .* - [F,L]

Kind regards :)

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

2 participants