Commit Graph

18 Commits

Author SHA1 Message Date
91a9a47f00 Added another selector for price on amazon (does not work for books) 2021-05-18 23:18:15 +02:00
63cbac5490 BETTERZON-58: Fixing API endpoint of the crawler
- The list of products in the API request was treated like a string and henceforth, only the first product has been crawled
2021-05-17 17:53:20 +02:00
73effffc89 BETTERZON-58: Adding access key verification 2021-05-17 17:32:52 +02:00
c8d37d60f8 BETTERZON-58: Fixing SQL insert 2021-05-17 17:25:01 +02:00
f98d1fdb24 Fixed string concatenation for sql statement in getProductLinksForProduct 2021-05-16 23:48:13 +02:00
776b9a00f2 Connected Api to Crawler 2021-05-16 23:16:57 +02:00
2067a47fb2 Added basic amazon crawler using beautifulsoup4 2021-05-16 22:05:32 +02:00
dbc793cc08 moved scrapy files to unused folder 2021-05-16 21:12:48 +02:00
f1d6487701 . 2021-05-16 15:48:08 +02:00
0a11b2b453 moved logic to amazon.py 2021-05-16 15:41:39 +02:00
root
d2a4d93f54 Added independent crawler function, yielding price 2021-05-06 00:09:30 +02:00
8e58efa42c BETTERZON-58: Basic Functionality with scrapy 2021-04-28 22:20:15 +02:00
Patrick
610808ad03
BETTERZON-59: Adding crawler basic framework (#29) 2021-04-14 21:51:36 +02:00
Patrick
f5fd1825d7
BETTERZON-56: Adding crawler load-balancing script (#28) 2021-04-14 18:52:22 +02:00
fafacdd942 BETTERZON-57: Adding utility sql functions 2021-04-13 21:10:02 +02:00
55a019d217
BETTERZON-49 (#24)
* BETTERZON-49: Creating Dockerfile

* BETTERZON-49: Added minimal Flask API as Docker container

Co-authored-by: Patrick Müller <patrick@mueller-patrick.tech>
2021-04-07 23:34:08 +02:00
8055f811d7 BETTERZON-49: Adding module .iml 2021-04-01 11:38:03 +02:00
05d4795f9d BETTERZON-49: Creating Module for Crawler 2021-04-01 10:34:35 +02:00