Scraping an Infinite Scroll page when access is denied to the Request URL
up vote
0
down vote
favorite
The following retailer product page has an infinite scroll which requests data from a URL I am unable to access:
https://www.aldoshoes.com/uk/en_UK/men/holiday-party-style
Once "View All" is clicked, Developer tools states a URL which fails to show any product data and gives an "unsupported brand" error:
https://www.aldoshoes.com/api/products/category/72050?currentPage=1&filters=&lang=en_UK&maxFilters=6&pageSize=17®ion=uk&sort=relevance
I would like to know if it is possible to access all of the product data through Scrapy without the need for Selenium.
Thank you.
scrapy
add a comment |
up vote
0
down vote
favorite
The following retailer product page has an infinite scroll which requests data from a URL I am unable to access:
https://www.aldoshoes.com/uk/en_UK/men/holiday-party-style
Once "View All" is clicked, Developer tools states a URL which fails to show any product data and gives an "unsupported brand" error:
https://www.aldoshoes.com/api/products/category/72050?currentPage=1&filters=&lang=en_UK&maxFilters=6&pageSize=17®ion=uk&sort=relevance
I would like to know if it is possible to access all of the product data through Scrapy without the need for Selenium.
Thank you.
scrapy
add a comment |
up vote
0
down vote
favorite
up vote
0
down vote
favorite
The following retailer product page has an infinite scroll which requests data from a URL I am unable to access:
https://www.aldoshoes.com/uk/en_UK/men/holiday-party-style
Once "View All" is clicked, Developer tools states a URL which fails to show any product data and gives an "unsupported brand" error:
https://www.aldoshoes.com/api/products/category/72050?currentPage=1&filters=&lang=en_UK&maxFilters=6&pageSize=17®ion=uk&sort=relevance
I would like to know if it is possible to access all of the product data through Scrapy without the need for Selenium.
Thank you.
scrapy
The following retailer product page has an infinite scroll which requests data from a URL I am unable to access:
https://www.aldoshoes.com/uk/en_UK/men/holiday-party-style
Once "View All" is clicked, Developer tools states a URL which fails to show any product data and gives an "unsupported brand" error:
https://www.aldoshoes.com/api/products/category/72050?currentPage=1&filters=&lang=en_UK&maxFilters=6&pageSize=17®ion=uk&sort=relevance
I would like to know if it is possible to access all of the product data through Scrapy without the need for Selenium.
Thank you.
scrapy
scrapy
edited Nov 11 at 4:45
asked Nov 11 at 4:39
Adam Smith 86
346
346
add a comment |
add a comment |
1 Answer
1
active
oldest
votes
up vote
0
down vote
accepted
If you look at the HTTP request that is made when you click on "View All" with your browser developer tools, you will notice that the request headers contain some application specific headers:
x-aldo-region: uk
x-aldo-lang: en_UK
x-aldo-brand: aldoshoes
x-forwarded-akamai-edgescape: undefined
x-aldo-ssr-request-id:
x-aldo-api-version: 2
Try to include these headers in your request.
Wow, fantastic, thank you!
– Adam Smith 86
Nov 12 at 1:59
add a comment |
1 Answer
1
active
oldest
votes
1 Answer
1
active
oldest
votes
active
oldest
votes
active
oldest
votes
up vote
0
down vote
accepted
If you look at the HTTP request that is made when you click on "View All" with your browser developer tools, you will notice that the request headers contain some application specific headers:
x-aldo-region: uk
x-aldo-lang: en_UK
x-aldo-brand: aldoshoes
x-forwarded-akamai-edgescape: undefined
x-aldo-ssr-request-id:
x-aldo-api-version: 2
Try to include these headers in your request.
Wow, fantastic, thank you!
– Adam Smith 86
Nov 12 at 1:59
add a comment |
up vote
0
down vote
accepted
If you look at the HTTP request that is made when you click on "View All" with your browser developer tools, you will notice that the request headers contain some application specific headers:
x-aldo-region: uk
x-aldo-lang: en_UK
x-aldo-brand: aldoshoes
x-forwarded-akamai-edgescape: undefined
x-aldo-ssr-request-id:
x-aldo-api-version: 2
Try to include these headers in your request.
Wow, fantastic, thank you!
– Adam Smith 86
Nov 12 at 1:59
add a comment |
up vote
0
down vote
accepted
up vote
0
down vote
accepted
If you look at the HTTP request that is made when you click on "View All" with your browser developer tools, you will notice that the request headers contain some application specific headers:
x-aldo-region: uk
x-aldo-lang: en_UK
x-aldo-brand: aldoshoes
x-forwarded-akamai-edgescape: undefined
x-aldo-ssr-request-id:
x-aldo-api-version: 2
Try to include these headers in your request.
If you look at the HTTP request that is made when you click on "View All" with your browser developer tools, you will notice that the request headers contain some application specific headers:
x-aldo-region: uk
x-aldo-lang: en_UK
x-aldo-brand: aldoshoes
x-forwarded-akamai-edgescape: undefined
x-aldo-ssr-request-id:
x-aldo-api-version: 2
Try to include these headers in your request.
answered Nov 11 at 8:18
Guillaume
9331624
9331624
Wow, fantastic, thank you!
– Adam Smith 86
Nov 12 at 1:59
add a comment |
Wow, fantastic, thank you!
– Adam Smith 86
Nov 12 at 1:59
Wow, fantastic, thank you!
– Adam Smith 86
Nov 12 at 1:59
Wow, fantastic, thank you!
– Adam Smith 86
Nov 12 at 1:59
add a comment |
Thanks for contributing an answer to Stack Overflow!
- Please be sure to answer the question. Provide details and share your research!
But avoid …
- Asking for help, clarification, or responding to other answers.
- Making statements based on opinion; back them up with references or personal experience.
To learn more, see our tips on writing great answers.
Some of your past answers have not been well-received, and you're in danger of being blocked from answering.
Please pay close attention to the following guidance:
- Please be sure to answer the question. Provide details and share your research!
But avoid …
- Asking for help, clarification, or responding to other answers.
- Making statements based on opinion; back them up with references or personal experience.
To learn more, see our tips on writing great answers.
Sign up or log in
StackExchange.ready(function () {
StackExchange.helpers.onClickDraftSave('#login-link');
});
Sign up using Google
Sign up using Facebook
Sign up using Email and Password
Post as a guest
Required, but never shown
StackExchange.ready(
function () {
StackExchange.openid.initPostLogin('.new-post-login', 'https%3a%2f%2fstackoverflow.com%2fquestions%2f53245893%2fscraping-an-infinite-scroll-page-when-access-is-denied-to-the-request-url%23new-answer', 'question_page');
}
);
Post as a guest
Required, but never shown
Sign up or log in
StackExchange.ready(function () {
StackExchange.helpers.onClickDraftSave('#login-link');
});
Sign up using Google
Sign up using Facebook
Sign up using Email and Password
Post as a guest
Required, but never shown
Sign up or log in
StackExchange.ready(function () {
StackExchange.helpers.onClickDraftSave('#login-link');
});
Sign up using Google
Sign up using Facebook
Sign up using Email and Password
Post as a guest
Required, but never shown
Sign up or log in
StackExchange.ready(function () {
StackExchange.helpers.onClickDraftSave('#login-link');
});
Sign up using Google
Sign up using Facebook
Sign up using Email and Password
Sign up using Google
Sign up using Facebook
Sign up using Email and Password
Post as a guest
Required, but never shown
Required, but never shown
Required, but never shown
Required, but never shown
Required, but never shown
Required, but never shown
Required, but never shown
Required, but never shown
Required, but never shown