Scraping bad

[Lestrades.com] Nao 尚

  • I'm Share-Locked
  • Administrator
  • Posts: 1,396
  • Trades: 223
Scraping bad
« on March 13th, 2017, 03:33 PM »
Anything related to the actual scraping should go in this topic.

What's scraping, by the way..? It's just a word we use for the process of one website's script browsing some other website(s) and 'spidering' (stealing?) their data for their own use. It's what I've been doing on Barter to import user tradables. It's what Barter has been doing to import Steam game data.

At some point in the future I'll switch to importing data from Steam directly, rather than using Barter.
This will have a few advantages:
- Will get more up-to-date data for recent games.
- I won't have to deal with stupid Barter hickups.
- I can more logically support adding non-Steam games manually to the database.

But these pale in comparison with the drawbacks:
- Riskier, because Valve may not like it. (Even if I'm going to be as soft as possible, e.g. maybe updating only recent games, older games less frequently, and very old games very rarely, unless a user visits the Steam game page and reports something changed.)
- I have to rewrite the whole damn thing! Damn! Thing! No, thing isn't a curse word.
- I'll have to determine what exactly I'm importing. For instance, Barter imports movies and things like that. is that REALLY necessary..?! This feels like a waste of time and space to me. If some bundle ends up having a movie key (like Groupees does from time to time), we can add those individually to the database. If someone really wants a movie in their wishlist, err... I don't know what to do. See what I mean, when I'm saying I don't know what I should import.
- And yes, what about demos and free games..? Will the games always be free? Even if I clearly indicate on the game page that the game is free and shouldn't be traded, it's still a problem. If a game was previously commercial and is now free, should it be removed from the database, upsetting people who have a copy up for trade? (Some people like collecting those as Steam gifts... I think.)

You have 2 hours!
(No, you have more actually. I'm just taking a break from coding and I don't know what to post.)

Akylen

  • Full Member
  • Posts: 218
  • Trades: 61
Re: Scraping bad
« Reply #1, on March 13th, 2017, 05:23 PM »
It would be nice to include everything steam has. That is considered tradable. However for now that is not needed... games are probably the priority.
I wouldn't include free games either, not at this stage.

[Lestrades.com] Nao 尚

  • I'm Share-Locked
  • Administrator
  • Posts: 1,396
  • Trades: 223
Re: Scraping bad
« Reply #2, on March 13th, 2017, 06:26 PM »
I'm not planning to prevent specific items to be entered, but I'm referring to the idea of preventing them from being indexed automatically. There are thousands of games on Steam that will neither interest anyone nor be bundled at any point (e.g. games), and I'm not a fan of the idea of cluttering the database even more with useless data. Barter will eventually support adding any package sub or item (app) not already in the database, it's just that I don't want to import movies 'by default'. See what I mean..?

Akylen

  • Full Member
  • Posts: 218
  • Trades: 61
Re: Scraping bad
« Reply #3, on March 13th, 2017, 07:34 PM »
Will the items not indexed still be possible to be added later by traders somehow?

[Lestrades.com] Nao 尚

  • I'm Share-Locked
  • Administrator
  • Posts: 1,396
  • Trades: 223
Re: Scraping bad
« Reply #4, on March 13th, 2017, 11:22 PM »
Yes Akylen, that was even the whole point of my post. :P

@theemu, is it really that important to get +1's...? :-/
I'm a collector, but I don't get the point of 'just' a +1. I collect games I'd like to play, not 'database flags'... Because that's what it all is, ultimately. If Valve gave you their infamous all-encompassing Steam gift (for actually working for them), all you've earned is a flag set to 'owned' on all the games at Steam... It doesn't cost them a dime.

I don't think Barter classified those 4 types in-site, apart from demos is there anything that helps spot them?
As I'm sure you're aware, some people actually add demos to their tradables or wishlists, so I'd also have to clean these up.
That GT5 link is weird, it doesn't seem to show up in Hikiri's tradables on Barter.
Do you think there's a point in updating your user script beyond next week, besides nostalgia..?
Hmm yes I guess the title fixes could be fun, but I believe I'm already getting all of my empty titles from the Steam API so it's not really needed.
Technically I'm leaning towards importing game titles from the future Steam Store scraper rather than the Steam API like I'm doing now. I can be convinced otherwise though. I find it so weird that Steam would encourage having different titles for both sources...

PS: beware, I'm currently in sleeping mode. I just finished another small step in my trading system and I'm too tired to even care about the fact that I'm terribly late for the release.

Akylen

  • Full Member
  • Posts: 218
  • Trades: 61

[Lestrades.com] Nao 尚

  • I'm Share-Locked
  • Administrator
  • Posts: 1,396
  • Trades: 223
Re: Scraping bad
« Reply #6, on June 16th, 2017, 04:45 PM »
(Edited your post to fix your broken links.)

It's no bug at all. This developer probably decided, at some point, to stop selling the base game (app), and only sell its deluxe package (sub), with the original app being part of it. So he probably also updated the app's banner to fit that change.
I've already seen this happen a lot. Doesn't change the fact that the base game WAS bundled at least once, and the package was also at least once. I don't know if redeeming the base app from the original bundle would give access to the rest of the sub... I doubt so!