The Wayback Machine is Deleting Evidence of Malware Sold To Stalkers (vice.com)
The Internet Archive's Wayback Machine is a service that preserves web pages. But the site has been deleting evidence of companies selling malware to illegally spy on spouses, Motherboard reported Tuesday. From the report: The company in question is FlexiSpy, a Thailand-based firm which offers desktop and mobile malware. The spyware can intercept phone calls, remotely turn on a device's microphone and camera, steal emails and social media messages, as well as track a target's GPS location. Previously, pages from FlexiSpy's website saved to the Wayback Machine showed a customer survey, with over 50 percent of respondents saying they were interested in a spy phone product because they believe their partner may be cheating. That particular graphic was mentioned in a recent New York Times piece on the consumer spyware market.
In another example, a Wayback Machine archive of FlexiSpy's homepage showed one of the company's catchphrases: "Many spouses cheat. They all use cell phones. Their cell phone will tell you what they won't." Now, those pages are no longer on the Wayback Machine. Instead, when trying to view seemingly any page from FlexiSpy's domain on the archiving service, the page reads "This URL has been excluded from the Wayback Machine."
In another example, a Wayback Machine archive of FlexiSpy's homepage showed one of the company's catchphrases: "Many spouses cheat. They all use cell phones. Their cell phone will tell you what they won't." Now, those pages are no longer on the Wayback Machine. Instead, when trying to view seemingly any page from FlexiSpy's domain on the archiving service, the page reads "This URL has been excluded from the Wayback Machine."
The Wayback Machine obeys robots.txt, even retroactively. If a site puts up a robots.txt file, archive.org will remove old versions of the site.
See https://archive.org/about/faqs...
If you want to delete your site from the wayback machine, all you have to do is ask them. They are not obligated to keep any page in the archive, whether it contains "evidence" or not. You can also exclude ia_archiver user agent in your robots.txt, which will prevent your site from being indexed in the first place. This way you will not even have to ask them.
It wasnt malware, in the American language it would be called something like a "analytics's and management platform, with realtime reporting and active asset monitoring and protection"
It is very annoying, but that's how it works. The worst is when a site that is owned by an entity who goes out of business is preserved by the wayback machine, but then another entity gets the domain, puts up a robots.txt and there goes all the history.
For all the good it is doing, it would be so much better if it did not apply robots.txt retroactively. It doesn't even make sense, robots.txt says "bots stay out", which is not nearly the same as "bots, forget whatever you had visited in the past"...