![]() |
Bots flooding GC
All the downtime today (August 26) was due to bots, specifically AI data scraper bots as far as I can tell but might also be search engine related.
They were sending far too many bots way too fast and it was making GC unusable. Basically caused an unintentional DoS (denial of service) "attack" on GC. So I turned GC off while getting that all sorted out. The IP addresses were from a major Internet company (hundreds or maybe even thousands of IPs) and were being used by a different major Internet company. TikTok / Bytedance apparently must be really quite interested in the conversations going on here at GreekChat! |
Must be due to #bamarush!
|
Can they be blocked?
|
Is that what happens? The more interesting the site, the more bots that show? I don’t understand what they collect and the importance of what they collect. And then where does what they collect go?
|
Quote:
I rerouted these particular bots in a way which should minimize their impact on GC. Similar to what I did with the bots from approx 2 weeks ago. Quote:
GoogleBot is a good example of a beneficial bot as that bot indexes sites & helps increase listings in search which might result in more new website visitors. Quote:
They might be training AI / artificial intelligence chatbots using conversations that they scrape from forums all throughout the Internet. Around a decade ago the world entered a new era of AI. Way back in the day one way AI was made was with programmers creating decision trees to determine what to do depending on different scenarios and that would be many levels deep. Modern AI doesn't do that. Instead, modern AI gives the engine, the neural network, as much labeled data as possible which the AI trains on in order to recognize patterns within the data. The AI is trained & tested over and over, repeatedly. Constantly refining, tweaking the neural net while continually improving accuracy. AI recognizing patterns in images is a good example of this type of training. With GC being text based conversations, AI data scraping on that is likely to be for training AI language models. Basically very sophisticated chatbots. Similar to ChatGPT if you're familiar with that. Huge amounts of data, as much as possible, is used in training these AIs. Quote:
|
Based on your post, John, it sounds like bots will be the reason for a future AI takeover. I mean, with the information they’re getting, it’s eventually going to be used for AI in everything. That’s the way it reads.
|
Quote:
But it does need lots of data for training, so bots are a part of it. Modern AI is such a revolutionary advancement that any businesses (or governments) that can benefit from it need to adopt it or eventually they will be surpassed. The disruption will be similar, probably more significant, to when the Internet started being commercialized and ecommerce was disrupting businesses in all different industries. |
Quote:
|
Quote:
Quote:
Quote:
I guess in comparing possible end results... AI could make life significantly better, more productive, more advancement, more efficient, more possibilities, etc. But future AI could also result in disaster, or maybe be used by people in order to cause a disaster scenario. So from that perspective, maybe AI will be more problematic because of the potential for serious harm. But that cat is out of the bag, no going back in. Even if it's regulated it's probably too late. Those who are regulated from developing new AI will simply be those who lose out to the people who take advantage of the technology. Consider Google's AI AlphaGo beating world master champions in the game of Go. AlphaGo was trained based on actual games between people. Then Google AI devs went and made AlphaGo Zero which was given the rules of Go and trained itself, basically played itself for a period of time and learned to master the game that way. AlphaGo Zero defeated AlphaGo significantly. I think at first it was just by a relatively small margin, then after some changes AlphaGo Zero won 100% against AlphaGo. I don't know the exact history of it, but it was something like that. So that was a game that an AI mastered versus people and another AI completely defeated the first AI. Consider that the military has war "games" for training & analyzing scenarios, etc. The US military not only should be using AI, they must be. They have to. Because if they are not and an enemy or future potential enemy develops AI that can defeat all the human military war game "players," like the AIs in the actual game Go, the side that does not have that advantage probably loses. And it might be that the government which reaches AI superiority first is the one that wins it all. Then there are governments that will use massive data collection & AI training/development against their own people. Lots of great benefits but also lots of very serious potential problems. |
^^^ Oh wow! I didn’t look at it from any of the perspectives you’re looking at it from, John. So true, and you make a lot of sense. I think greed will be the downfall of it, though. I mean, for like the other side of the benefits. The bad side. It’s just that a lot of companies now launch AI teams because they’re afraid of falling behind other companies/competitors, without fully knowing where or for what purpose they’ll use AI. And then too, a lot of companies pretend to use AI when they don’t, just to increase their chances of obtaining funding. That’s the greed part. That’s the part I think will get worse. There’s like also a fair amount of general confusion about what AI can and can’t do.
What’s interesting though is that we now use it a lot, daily, sometimes without even realizing it. Do you think it can or will get out of control? I mean, right now AI is completely under human control, but in the future, it might not be under our control anymore. Seems like eventually every single task is going to be done by AI. |
Quote:
In some sense, there have been AIs which temporarily got out of control until they were reigned in. That would be some computer viruses. Although they aren't modern AI. Eventually there will be modern AI based computer viruses, though. With that in mind, there will be AI trained to hack computers. I'd be really surprised if that hasn't already been accomplished. If it hasn't or at the very least if it isn't being worked on & developed then many people in the computer security side of government (and cyber warfare) need to lose their jobs. Hacking is like solving puzzles... puzzles are games, basically & AI has already mastered many games, so why not hacking. Problem is that if we don't do this what if an enemy does. Train the AI to hack, train the AI to find new vulnerabilities, train the AI to infect, to evade detection, stuff like that and we have a real problem, especially if integrated into a virus. Then there will be AI trained to protect against AI that is trained to hack. And who knows which side will win that battle. But it doesn't even need to be an AI designed to do bad things for it to get out of control. Could be a completely well intentioned AI but some aspect of the logic was missed or not protected and the AI determines a solution to whatever problem it is solving is to do something really terrible. It all seems like science fiction but we probably aren't too far off from having self driving cars that are safer than people driving. AI controlling things that have the potential to do a lot of damage becomes more and more real over time. |
This is interesting, John. I remember when I was a flight attendant, our airplanes had auto-land, so they could land themselves. I mean, the pilots still had to make a few adjustments, but not many. Human error is what generally causes plane crashes, when they rarely happen. I’m sure the same would be true with self-driving cars and trucks.
I hope a massive cyber attack where it affects how we live never happens. There are so many things we take for granted that can be wiped out by cyber attack from an enemy. |
Quote:
Quote:
Self driving cars are based on modern AI tech which is quite different in how it functions. I don't know if self driving cars are yet statistically safer than people driving. But over time I'm fairly sure that self driving AI will approach then eventually surpass the ability of people to drive safely. Quote:
One example is the Colonial Pipeline crypto ransomware attack in 2021. That cyber attack had an almost immediate significant impact, but the company was able to resolve fairly quickly. When that pipeline was shut down it impacted most of the US East Coast and several States declared States of Emergency due to it. The day after that cyber attack the company paid $5 million in ransom and luckily the hackers provided the fix after getting their ransom. There have been many companies and different levels of government that have had to pay ransom to hackers to recover essential systems. I recall reading of a number of companies that went bankrupt after not being able to recover from cyber attacks. Quote:
|
Quote:
|
Quote:
Quote:
|
Quote:
Quote:
|
Quote:
Then they began extortion scams- so they'd say to pay the ransom to get your data back AND to prevent them from exposing the data they had on the dark web. It's difficult to get indicted/convicted because most of this is happening from outside the US. And a lot of it is nation state. NOW they have gone a step further and offering protection services from other scammers if a company will pay them monthly- very much like organized crime rings operate. We'll give you your data back, we'll keep it off the dark web and we'll give you protection. This is why I have job security... |
Quote:
|
Quote:
|
Quote:
There's also an interesting interview by one of the agents involved here: https://www.youtube.com/watch?v=4KiO8GRgwDk I haven't watched that entire interview but there are clips floating around. Some parts I did watch included where he talked about only the hackers who make mistakes get caught and/or something about catching the low hanging fruit, but the others have not been caught. Something like that. There's another hacker story where the guy was caught & in jail, working on his own case. Discovered that the gov used some device to masquerade as a cell phone tower, they drove around something like a 2 square mile area collecting call data from a huge number of people. I think it may have been 100,000+ or so. I may be a little off on these facts as it's been a while since I learned about this stuff. That hacker I believe got off because the gov did that mass data collection, trying to find the hacker, without a warrant or something like that. The hacker or maybe his lawyer just happened to stumble upon the name of the device out of thousands of pages of documents. Quote:
Quote:
Quote:
Quote:
|
⬆️ John, this is really crazy. The video is long but still worth watching. I wonder if there’s ever a single hacker. I mean, someone that doesn’t work with a group of hackers. If I was a thief, I’d just do it once and live off of the first millions of dollars, and be done, lol.
But seriously, if I was that computer literate, I would just find a really high paying tech job. I don’t know why they just won’t do that, instead of risking their freedom. Being that intelligent, you could really negotiate what you want. Have you ever seen the move “The Core”? There was a hacker in the movie who got caught, but then the government forced him to work for them or go to prison. It was kind of a cheesy movie, but some parts were ok. Quote:
|
Quote:
Quote:
Quote:
In that YouTube interview the agent talks about catching one of the silk road guy's accomplices. I think they made that hacker work for them for nearly a year or so. |
Somewhat related but a massive spam bot actually took over Greek rank, specifically the SMU page. It’s been nuts to see hundreds of pages of the same spam over and over again. I assume it has to be using a program to spam that hard.
|
We had one about 10 years ago that was Indian astrologer spam. It was awful.
|
Quote:
Quote:
He was surviving off of hot pockets in the movie “The Core”. I watched about 45 minutes of that video. I’m going to continue watching it this weekend. |
Quote:
There are MS Windows based programs that people run on their computers even just for spamming forums. Then programs on servers. And bot nets that run through virus infected computers/servers & are controlled remotely. There could even be people here reading this whose computer is part of a bot net and they don't even know it. Quote:
In addition to the Indian astrology spam I recall some major fashion brand spamming GC for a long time as well. There was also a celebrity who must have hired spammers to promote her new show... I emailed them and told them to just sponsor GC to promote their show instead of spamming us. Then my email... one account was receiving 8000+ emails a day when I decided to shut it down. The spam on the forums and email was a real nightmare to deal with. May have been impossible to handle all that without the help of many site moderators. GC also wasn't my only forum site. The other forums I shut down new account registrations on years ago. Was just too overwhelming to continue dealing with all the spam / spambots littering the sites with spam from every different direction. On GC I made it difficult to easily create/verify new accounts and also use a crowd sourced spam detection service, which is quite imperfect and probably has too many false positives. That service has been used to block something like 10 million + account registrations on GC. I often wonder, though, about their false positive rate. Even 1% would be a huge loss. At some point I'll get around to looking into that more and will post additional details. Quote:
|
There are definitely warehouses of people in other countries who are scamming people as their full time jobs. They go to work, catfish people in romance scams, etc. The vast majority of hackers attacking us here are not in this country.
|
How can scammers live with themselves--cheating people? What would their mothers think if they knew what they're really doing?
|
Quote:
|
Bots are flooding GC again. Nearly 6,500 online users right now...
There are always bots, search spider bots are fine & not a problem. This current flood, though, is probably the AI scraper (data harvesting) type again. Will try to get things sorted out with this batch of bots before they cause too much trouble... |
Quote:
I thought you made it where it’d be harder for bots to attack the site. Did they manage to get around it? |
The bots that were causing trouble today were sorted out earlier this evening. Highest I saw peak online hit today was just under 8,000.
Quote:
Quote:
Quote:
Not sure yet but I've been thinking about spending more time in May and/or June on GC projects again, maybe improving bot mitigation will be included if I do that. |
I noticed that the majority of them were looking at the same thread.
|
Quote:
Last summer was a similar issue. Bots from 2 IP addresses loading all the pages from the print version of the original "What do you feel like saying, right now?" thread. Those bots were loading something like 50 to 100 million pages per month from that thread. They were a major contributing factor causing the site to slow down so much for a while up until then. The bot flood back in the fall I think was mostly scattered across many different pages. |
Quote:
|
We've hit a new record of bots on GC (unfortunately).
From the Online Users page: "Most users ever online was 23274, Today at 08:10 AM." That stat is over a 15 minute period. So 23,274 "users" from 7:55 to 8:10 AM today. Probably 99% bots. The previous record was 15,051 back in August 2024. Quote:
The bots causing the massive slowdowns on GC leading up to last summer were originating from two specific IP addressed at a cloud web services provider. Those IPs I redirected in a way where it would not be clearly obvious that they were redirected and no longer on GC. Since then I've redirected many other bots/IPs in a similar way. Most seem to be originating from China, Singapore and now Brazil. Not sure yet how I'll be managing this most recent onslaught of bots. The bulk of them seem to have gone away, but could be back at any moment. |
Thank you so very much, John, for keeping GC up to date with the Battle Against the Bots!
|
Quote:
John, where’s the coffee thread? I want to buy more coffee. You work so hard to keep this site going and I just want to thank you in the best way I possibly can. |
Quote:
"Battle Against the Bots" for some reason reminded me of a TV show I used to watch some years ago called BattleBots: https://battlebots.com/ I used to love that show. Maybe I'll start watching again and when they're bashing each other to pieces I'll imagine they are the internet bots that were attacking GC. ;) Quote:
Sometimes the top level domain country code is in the IPs reverse DNS as well. However, there's no guarantee that the actual source is from those countries. Could be they are just renting servers from datacenters / cloud server providers in those countries or those IPs could be from some sort of bulk proxy / VPN type services with IP addresses assigned to organizations in those countries. I suppose it may also be possible that those IPs are registered to orgs in those countries but being used somewhere else. Quote:
The original "Buy GreekChat a Coffee" thread is at https://greekchat.com/gcforums/showthread.php?t=248704 And the direct link to the Buy Me A Coffee page for GC is https://buymeacoffee.com/greekchat |
Quote:
|
All times are GMT -4. The time now is 09:11 AM. |
Powered by vBulletin® Version 3.8.11
Copyright ©2000 - 2025, vBulletin Solutions Inc.