OpenAI Scans for Honeypots. Artificially Malicious? Action Abuse?, (Thu, Aug 22nd)

Resident Pulser@infosec.pub · 2 months ago

OpenAI Scans for Honeypots. Artificially Malicious? Action Abuse?, (Thu, Aug 22nd)

drkt@lemmy.dbzer0.com · 2 months ago

I have a robots.txt which does nothing but exclude “GPTBot”

I was coincidentally looking at my logs for unrelated reasons and caught it reading my robots.txt… and then promptly ignored it and scraped my whole site. Like yeah okay cool man

aviation_hydrated@infosec.pub · 2 months ago

How does Reddit block users? Just by headers and IP addresses? Could the same be done once the GPTbot headers are known?

drkt@lemmy.dbzer0.com · edit-2 2 months ago

Every bit of information being sent to your web server can be spoofed. There is nothing you can do about this unless you’re willing to exclude an increasing percentage of real users.

My webserver is constantly barraged by crawlers and bots because I have zero defenses. I’ve considered intercepting the obvious ones, like the ones targeting wordpress plugins. I don’t use wordpress. I could serve them a 200 instead of a 404 and hopefully waste a real humans time if they check the hits manually.

GBU_28@lemm.ee · 2 months ago

Header spoofing is scraping 101

OpenAI Scans for Honeypots. Artificially Malicious&#x3f; Action Abuse&#x3f;, (Thu, Aug 22nd)

OpenAI Scans for Honeypots. Artificially Malicious&#x3f; Action Abuse&#x3f;, (Thu, Aug 22nd)

OpenAI Scans for Honeypots. Artificially Malicious? Action Abuse? - SANS Internet Storm Center

OpenAI Scans for Honeypots. Artificially Malicious? Action Abuse?, (Thu, Aug 22nd)

OpenAI Scans for Honeypots. Artificially Malicious? Action Abuse?, (Thu, Aug 22nd)