-
Notifications
You must be signed in to change notification settings - Fork 53
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Public scraping methods fail due to login screen on Instagram on production builds #24
Comments
This is odd. |
Thanks for the quick reply but it still not work... |
It works for me. I am not sure what is happening locally for you. Are you able to go on instagram? was it working for you and just broke or never worked? |
I can go to instagram in the browser. But still not work. Error message like these: warning The gatsby-source-instagram plugin has generated no Gatsby nodes. Do you need it? My location is China, it must be the issue with GFW...I tried use proxy in the terminal, then npm run develop, but still wait for a long time then failed...😭 |
:( I am really sorry but i am not sure how i can help here :( You will need to try with a vpn or something. |
@Jexxie Instagram maybe blocked in China according to the "law", so you may need to find other way. This should not be an problem of this plugin. |
Closing this! feel free to reopen if you think there's something to be done in the scope of the plugin! |
I am having trouble with this same description when I try to build my Gatsby app on AWS but when I try locally somehow it works. I thought I solved the problem by updating npm packages and I had 2 successful builds but today it is not building. Any help on how to solve this problem?
|
Now I am intrigued. I've just redeployed my app and it worked. Any idea why this seems to randomly fail? I wouldn't like to have a few failed builds hoping the next one will be the one every time I need to deploy. |
Hey there, kind of hard to know without reproduction :/ Is it with public scraping or with the graph api? |
Hi. Yes it's running on amplify and with public scrapping, with just an username added to the plugin config parameters. GraphQL query looks like this:
` |
Yeah query isn't the issue. the error you showed above is when it couldn't get the instagram posts. maybe something wrong with instagram during that particular time? does it still happening? I just triggered another rebuild on netlify for the example app and seems to be working fine |
I seem to have the same issue. Just a week ago it seemed to have worked fine.
|
I am seeing this now on netlify as well. locally works fine. I am investigating right now |
Well now it went through on netlify. I think maybe network issue? I can't find any way to reproduce locally so not sure what i can do here :/ |
I am really not sure what is happening here, but it seems like this is not the plugins fault? I deployed the same built a couple times on Netlify every time clearing cache before building. The first two times everything works fine and then the 3rd time it breaks.
Btw, I am not using an Instagram API, maybe this makes a difference? |
I'm having the same problem on Netlify, seems an issue related to the platform, on development it works. |
Anything we can do about it? Some workaround? |
Well the issue is not with the plugin afaict. it is happening when scraping the page doesn't work. But i can't find out whether this is an issue on Netlify (much more likely since it never happens in dev) itself or if Instagram is testing things out. After all the public scraping is extemely dependent on their raw html code :) Only thing i can susggest if use the API. it worked the whole time i was testing. Has anybody tried to contact Netlify support to see if anything is off in the network? |
I am re-opening this for visibility among folks checking the issues. |
Same problem happening here. I tried a couple of times today and the build is failing. is there any workaround for this? @oorestisime thanks for all your replies :) |
I'm having the same issue although my builds fail randomly and hitting redeploy helps. This is the error message:
|
@oorestisime thanks for reopening this issue! I have posted this on the Netlify community as well. |
I've also been running into this issue. Noticed a string of failed builds on the deploy dashboard with the same error. I hope your Netlify post brings attention to the problem! |
If you could join the netlify post and let people know you have had the same issue, maybe it will gain more attention. Would really appreciate it! Thanks so much here already for all the help! |
@samason can you share some code example on what you've done? Can you fetch instagram as not authenticated user?
I think I'll have a go, today my buids start to fail again even if I was using the id method |
Ok just in case someone want to follow this path this is my attempt to fetch all the instagram phost from the client
|
I encountered this issue as well last year. And I don't think it's this plugin's fault, but rather down to Instagram doing weird stuff. In the meantime, I also went with fetching the posts manually and wrote a blog post about how to do so. |
Hey @LarsBehrenberg, thanks for sharing! The method you describe is using the same end-point as the current Public scraping for posts:
I see that it is working in the demo. But I wonder if it would also suffer from the same "rate-limiting" login wall that the we've been hitting lately. In my use-case, for an artist's representation agency, I am scrapping the posts of 60+ Instagram accounts and after a few builds within a short time-frame we face the login wall. I wonder if the dynamic component would suffer a similar fate if the page had many visitors and thus many requests to end-point? |
@owenhoskins Good question! That might as well be the case. I haven't run into any issues using this method yet and been using it for the last 8 months or so. So I guess you'll just have to try out and see what happens... Sorry for not being more helpful :S |
@LarsBehrenberg Thanks for your effort and I like the approach. But currently, your demo isn't working any longer – at least for me: |
Note sure, just checked and it's working fine for me. |
I can reproduce these errors by using the same IP address, but it works by changing my IP address via VPN. So is the consumption of that API depending on the end-users rate limit? |
That exactly what happened when I deployed to Netlify. |
Same here @VT-Web-Development! were you able to fix it? Having this issue on multiple websites now.. |
No - I am not using it for now. I will try another solutions. |
Has anyone found a solution to this or a solution in general on how to get Instagram data in a react/gatsby app? |
@joshua-isaac The only thing that works for me is Zapier > Airtable |
But you have to pay for it. |
Hi, just to give a heads up. There is nothing to do here for the plugin. Its on instagram to stop their paywalls i cant do anything :( |
Hi everyone, thanks for this thread. So just to clarify, if we use the Graph API method rather than public scraping are there still issues? If not, does anyone know of any up to date tutorial/guides on how to set up? Instagram/FB seem to want to make it as convoluted as possible... I'm using public scraping during development and it usually works as long as the dev server isn't restarted numerous times (which fits with what everyone is saying). Haven't yet deployed to Netlify though. |
Hey there, I found out that my access token was no longer valid. After creating a new version via the steps explained here: https://www.gatsbyjs.com/plugins/gatsby-source-instagram/#instagram-graph-api-token, the public scraping worked again. You can check out your token here: https://developers.facebook.com/tools/debug/accesstoken/. |
hey @tijsluitse thanks for the links! I was facing this issue and I followed that procedure and was able to get an access token. Though unfortunately I wasn't able to get a token that never expires :/. Simply when I click "debug" for the second time I get told it still expires in three months. The only thing that I had to do different is using v10.0 of the Graph API, as I had no other option given in the Explorer. |
@matteocarpi I have the same thing that I cannot access any other API version, but I cant find the permissions mentioned, were they available for you? |
hey @kanlanc yes, but it was tricky. When you click on "Generate Access Token" and a new window pops up, you should be able to click on "Edit settings" or something similar, and allow the app to access the facebook page connected to the instagram account (if that makes sense). at that point, once you confirm and go back to the API Explorer, you should be able to see all the permissions you need. Hope that helps :) |
same, my clients bothered me recently and I still cannot resolve the issue |
So, public scraping is no longer available...? |
Public scraping will fail more frequently on hosting sites like Netlify and Gatsby Cloud because there are likely many sites doing public scraping on Instagram via those hosts. Instagram then rate limits those host servers by returning a login page, which causes most builds using public scraping to fail on those hosts. |
Closing this as i dont think we can do anything more here! |
I use this plugin in my demo and could not work, it says could not fetch instagram posts, no Gatsby nodes generated(I did't use any token and just want to Public scraping for posts). So I cloned your repo, cd into /example, yarn install and run 'npm run develop'. And then it seems like the same problem happened. Wonder to know if i am missing something? Thanks!
The text was updated successfully, but these errors were encountered: