Reality Check: Automated Shopping Bots are a Business Problem

Last week, I had the pleasure of participating in a webinar on automated shopping bots with Sandy Carielli, Security and Risk Analyst at Forrester Research. The webinar highlighted two things for me: automated shopping bots are a complex problem and th… Continue reading Reality Check: Automated Shopping Bots are a Business Problem

Job Application Script Automates The Boring Stuff With Python

Job hunting can certainly require a good amount of hoop-jumping in today’s age. Even if you’re lucky enough to have your application read by an actual human, there’s no guarantee the person on the other end has much of an understanding about your skill set. Oftentimes, the entire procedure is …read more

Continue reading Job Application Script Automates The Boring Stuff With Python

DC Court Ruling Reduces Webscraping Risk

In a decision that reduces some risk associated with webscraping, the United States District Court for the District of Columbia ruled that violating a website’s terms of service cannot alone be the basis for a finding that the conduct is “… Continue reading DC Court Ruling Reduces Webscraping Risk

Facebook suit accuses two Ukrainians of distributing adware disguised as quizzes

Facebook has accused two Ukrainian men of using quiz apps on the social media platform to inject malicious software on people’s computers, according to a lawsuit first noticed by the Daily Beast. By installing software extensions that masqueraded as Facebook quizzes, users unwittingly allowed the two men to inject advertisements into their news feeds and access their lists of friends, according to the lawsuit. That information then was exfiltrated to servers outside the country. The two men, Andrey Gorbahov and Gleb Sluchevsky, are Kiev-based entrepreneurs affiliated with a company called the Web Sun Group. The company did not respond to a request for comment from the Daily Beast Friday, and its website appeared to be down by Monday. “In total, Defendants compromised approximately 63,000 browsers used by Facebook users and caused over $75,000 in damages to Facebook,” the company claims in the lawsuit. The activity lasted from 2016 until October 2018 and primarily […]

The post Facebook suit accuses two Ukrainians of distributing adware disguised as quizzes appeared first on CyberScoop.

Continue reading Facebook suit accuses two Ukrainians of distributing adware disguised as quizzes

splashr 0.6.0 Now Uses the CRAN-nascent stevedore Package for Docker Orchestration

The splashr package [srht|GL|GH] — an alternative to Selenium for javascript-enabled/browser-emulated web scraping — is now at version 0.6.0 (still in dev-mode but on its way to CRAN in the next 14 days). The major change from ver… Continue reading splashr 0.6.0 Now Uses the CRAN-nascent stevedore Package for Docker Orchestration

‘data:’ Scraping & Chart Reproduction : Arrows of Environmental Destruction

Today’s RSS feeds picked up this article by Marianne Sullivan, Chris Sellers, Leif Fredrickson, and Sarah Lamdanon on the woeful state of enforcement actions by the U.S. Environmental Protection Agency (EPA). While there has definitely been overr… Continue reading ‘data:’ Scraping & Chart Reproduction : Arrows of Environmental Destruction

More “Scraping Ethics Gone Awry” and “Why Do This When There’s a Free API?”

I can’t seem to free my infrequently-viewed email inbox from “you might like!” notices by the content-lock-in site Medium. This one made it to the iOS notification screen (otherwise I’d’ve been blissfully unaware of it and… Continue reading More “Scraping Ethics Gone Awry” and “Why Do This When There’s a Free API?”

Introducing ‘gepetto’ — a Splash-like REST API to Headless Chrome

It’s been over a year since Headless Chrome was introduced and it has matured greatly over that time and has acquired a pretty large user base. The TLDR on it is that you can now use Chrome as you would any command-line interface (CLI) program an… Continue reading Introducing ‘gepetto’ — a Splash-like REST API to Headless Chrome

In-brief: splashr update + High Performance Scraping with splashr, furrr & TeamHG-Memex’s Aquarium

The development version of splashr now support authenticated connections to Splash API instances. Just specify user and pass on the initial splashr::splash() call to use your scraping setup a bit more safely. For those not familiar with splashr and/or … Continue reading In-brief: splashr update + High Performance Scraping with splashr, furrr & TeamHG-Memex’s Aquarium