Defered data for download button #5053

Vinno97 · 2022-07-28T12:38:04Z

Problem

The download button currently expects its data to be available when declaring the button. If data needs to be read from disk (or worse: compiled multiple disk sources), this can make the app needlessly slow.
In my app, the data downloading is not a common use case, but the packing of the data for downloading is relatively expensive. Caching helps, but only when the data doesn't change.

Solution

I propose a method to only load and preprocess (archive, pickle, etc) when the download is actually requested.

I propose to also allow a function as a data type that gets called as soon as the download button is pressed. This callback then returns the actual data.

def get_data():
    data = some_heavy_data_loading()
    return data

st.download_button("Download Data", get_data, file_name="my-data.dat")

Possible additions:

Currently a download button accepts str, bytes, TextIO, BinaryIO, or io.RawIOBase. With deferred loading, it would also be possible to accept a file pointer and stream the data to the user. This might bring huge speed and memory benefits when downloading large files.

Technically this streaming would also be possible without deferred loading, but then you're keeping unnecessary files open.

Community voting on feature requests enables the Streamlit team to understand which features are most important to our users.

If you'd like the Streamlit team to prioritize this feature request, please use the 👍 (thumbs up emoji) reaction in response to the initial post.

The text was updated successfully, but these errors were encountered:

lukasmasuch · 2022-07-28T14:47:01Z

@Vinno97 Thanks for the suggestion. This would be indeed a nice addition to the download button, especially when dealing with large files. I will forward this feature request to our product team.

tomgallagher · 2022-07-29T05:36:47Z

In the meantime, I'm using this as a way of ensuring that page flow is not interrupted by large file prep

def customDownloadButton(df):
    if st.button('Prepare downloads'):
        #prep data for downloading
        csv = convert_df(df)
        json_lines = convert_json(df)
        parquet = convert_parquet(df)
        tab1, tab2, tab3 = st.tabs(["Convert to CSV", "Convert to JSON", "Convert to Parquet"])
        with tab1:
            st.download_button('Download', csv, file_name='data.csv')
        with tab2:
            st.download_button('Download', json_lines, file_name='data.json')
        with tab3:
            st.download_button('Download', parquet, file_name='data.parquet')

jrieke · 2022-07-30T00:08:27Z

Yes agree! Back when we implemented download button, I know that we also thought about allowing users to pass a function. Not sure if we cut that just to reduce scope or if there were any reasons against doing that. Will revisit!

xR86 · 2022-08-29T14:55:59Z

I also had this issue, but it appears that it does approximately what you proposed, @Vinno97 ?
The docs mention that you could have a callback for this.

Not sure if I'm missing some nuance with blocking when downloading large files, but I've already used this for data to be generated on click, regardless if it's data files or octet streams to be saved as files (eg: zip).

Lifted from the docs:

@st.cache
 def convert_df(df):
     # IMPORTANT: Cache the conversion to prevent computation on every rerun
     return df.to_csv().encode('utf-8')

csv = convert_df(my_large_df)

st.download_button(
     label="Download data as CSV",
     data=csv,
     file_name='large_df.csv',
     mime='text/csv',
 )

@jrieke Was this functionality added in the meantime and not linked to this issue ?

jrieke · 2022-09-23T22:59:36Z

Nope we didn't implement this yet. We don't have a timeline yet but I'm 99 % sure we want to do this at some point.

amirhessam88 · 2022-12-26T02:44:38Z

Any progress on this ? Do we have an ETA when this bug is gonna be fixed?

wolfgang-koch · 2023-01-09T11:49:14Z

I would appreciate if this gets resolved. I already tried to address this issue on the forum a couple months ago: https://discuss.streamlit.io/t/create-download-file-upon-clicking-a-button/32613
My idea was to solve this using some JS, but it's messy and causes some slight shifting down of the page content.

In my opinion, st.download_button should only fill memory with the file's content upon acutally clicking the button instead of on every script re-run.

jzluo · 2023-01-19T03:11:29Z

I'd also like to voice appreciation this feature. I finally tracked down my app's occasional hanging to this issue. In the meantime, gating the download button behind a "prepare data for download" button like @tomgallagher's example above is a clumsy but okay workaround.

HStep20 · 2023-01-28T16:13:33Z

This would be a great feature. I know its highly requested, but when working with APIs, the lack of this feature makes it a miserable experience. It has to hit the API each time the page is reloaded to prep the download, meaning lots of requests within a quota are used up. Its even worse if you have multiple tabs on a page, each of which download a different dataset for the user - It means x api calls per page load, per tab, each time the script is rerun.

Ive mitigated it by using a nested button like tom suggested, to 'get' data, then show the download button to download it, but a proper way to combine both into one UX Action would be amazing.

masonearles · 2023-03-14T03:41:51Z

+1

ElenaBossolini · 2023-08-12T19:32:53Z

Same problem here. In my case I need to generate a excel file from multiple large pandas dataframes (one dataframe per sheet). I write the data as BytesIO.
The experience is that going from a pandas dataframes to a BytesIO buffer takes about 0.003s, but on the streamlit app, the user is left hanging for multiple seconds. Something between 5s and 10s.

SabraHealthCare · 2023-10-28T12:31:35Z

def get_data():
    data = some_heavy_data_loading()
    return data

st.download_button("Download Data", get_data, file_name="my-data.dat")

def get_data():
st.write("test")
data = some_heavy_data_loading()
return data

I added 'st.write("test")' in get_data, and found that "test"was printed before download_button. it means the get_data() still runs even download button is un-clicked.

andrewpimm · 2023-10-30T10:16:26Z

def get_data():
    data = some_heavy_data_loading()
    return data

st.download_button("Download Data", get_data, file_name="my-data.dat")
def get_data(): st.write("test") data = some_heavy_data_loading() return data

I added 'st.write("test")' in get_data, and found that "test"was printed before download_button. it means the get_data() still runs even download button is un-clicked.

Unless there has been an update that hasn't been announced here, I'm not sure that a function can be called from st.download_button in this way.

jsulopzs · 2023-12-13T09:25:25Z

+1 to this feature, it'd be great for developers to create custom calculators that provide business value and a rich UX.

CharlesFr · 2023-12-17T20:33:04Z

any updates on this feature?

ViniciusgCaetano · 2024-01-06T15:03:52Z

+1

zbjdonald · 2024-01-11T08:13:58Z

any updates on this feature?

LarsHill · 2024-01-13T14:18:12Z

I came across this issue as well. Besides large data payloads being created on every run, it is annoying that there is no way to create the data only after the download button is clicked.
In my case the raw data to be downloaded is created and stored as session state "after" the position of the download button in the code. Now when I click the download button the previously created data state is downloaded but not the current state.

Here is an example:

create_data = st.button("Create data")

if "data" not in st.session_state:
    st.session_state.data = None

st.download_button(
    label="Export",
    data=st.session_state.data,
    file_name=file_name,
)

if create_data:
    # logic to create data here
    st.session_state.data = create_data_logic()

Now, first I click on the "creata data" button and afterwards I click the "download" button but only None is downloaded.
Only on an app rerun the session state available to the download button is updated and the correct data is downloaded.

If the data creation process could happen in a callback after the download button is clicked, there would be no issue...

Currently this workaround does the job for me, but I feel this should be natively possible in strewamlit without js hacks...

anki-code · 2024-02-08T16:03:36Z

Personally I want to say that Streamlit is very unpleasant for new users and I need to google every step and I continuously facing with issues with use cases. And yes, I want to +1 this bug too because when I want to download the data I want to click on the button, wait processing and get the data.

sfc-gh-pkommini · 2024-02-27T01:51:17Z

Hi Team,

We currently have the same issue and makes st.download_button unusable in production. Is there a workaround till the callback function is added? Also is there an ETA for the data callback being added?

goyodiaz · 2024-02-27T17:12:26Z

@sfc-gh-pkommini The only workaround I ever found is using two buttons as posted above.

BenGravell · 2024-03-10T12:50:24Z

+1 on this issue.

A super basic use-case is offering users a download of PNG images. This is a typical desire of a user if you want "archival quality" and are willing to eat the storage size - forcing people into JPEG all the time is not nice. PNG being mostly uncompressed means the filesize / data payload is going to be higher. Even moderately large PNG of dims 3072 x 4096 ends up being ~26 MB, which is totally feasible for generating in-memory and offering for one-off downloads. The ask is just to defer the costly serialization operations until the user actually clicks the download button, rather than having to do it every time just to display a download button. The workaround is too fiddly and requires too much ad-hoc state management to really be called a solution IMO.

iandesj · 2024-04-03T17:40:09Z

My team encountered this bug when apps are deployed in replicas to something like Kubernetes.

jayco10125 · 2024-05-12T23:16:06Z

2 buttons to do the job of 1 is not a suitable workaround. I end up have to have a save and export button when really I should just have an export button. Would be much appreciated if this was included, am very surprised it hasn't been already since being requested 2 years ago..

jrieke · 2024-05-14T21:44:12Z

Update

Hey all! Sorry for not getting back to this issue for a while. We built a prototype of this last year. However, the implementation was a bit hacky and would have required a bigger effort to get right, so we decided not to pursue it further for the moment.

Given that we recently released partial reruns via st.fragment, there's a good chance we want to rethink the implementation so the function that creates the data can run without rerunning the rest of the script. Other than eng time, there's not really a blocker for this project. Right now, though, we're still working on higher-priority projects. I'll update here once we start working on this again!

kocielgr · 2024-05-23T21:16:25Z

+1

gauthamkumaran · 2024-05-30T20:08:56Z

+1

QuestMi · 2024-05-31T01:48:40Z

Hi, I created a label to avoid reloading.

effect

    @staticmethod
    def export_to_xlsx(tab_name, data_df):
        io_bytes = io.BytesIO()
        writer = pd.ExcelWriter(io_bytes, engine='xlsxwriter')
        data_df.to_excel(writer, index=False, sheet_name=tab_name)
        worksheet = writer.sheets[tab_name]
        worksheet.set_column("A:AZ", 15)
        writer.book.close()
        b64 = base64.b64encode(io_bytes.getvalue()).decode()
        href = f'<a href="data:application/vnd.ms-excel;base64,' \
               f'{b64}" download="{tab_name}.xlsx" ' \
               f'class="download-btn" style="color: white"> ' \
               f'下载数据: {tab_name}</a>'
        st.markdown(f'<style>{DTN_CSS}</style>', unsafe_allow_html=True)
        st.markdown(href, unsafe_allow_html=True)

DTN_CSS = """.download-btn {
                   display: inline-flex;
                   align-items: center;
                   justify-content: center;
                   height: 25px;
                   padding: 0 4px;
                   font-size: 12px;
                   font-weight: bold;
                   line-height: 1.5;
                   text-align: center;
                   text-decoration: none;
                   white-space: nowrap;
                   vertical-align: middle;
                   cursor: pointer;
                   background-color: #FF4B4B;
                   border-radius: 6px;
                   transition: all .1s ease;}
           """

FJakovljevic · 2024-07-04T01:58:11Z

Here is one way how to have button that will execute download function only when clicked.
The added state is to avoid flicker since when button is clicked it will start a rerun of page from there and create empty div at top of the page (usually not what you want). This way it will create it as last thing on page and wont affect other elements, and there wont be flicker.

Using streamlit_javascript library but can be done also without it

import base64
import time

import streamlit as st
from streamlit_javascript import st_javascript


def download_text(text, filename):
    # long time process
    time.sleep(3)
    b64 = base64.b64encode(text.encode()).decode()
    js_function = f"""(function() {{
        var link = document.createElement('a');
        link.href = 'data:text/plain;base64,{b64}';
        link.download = '{filename}';
        link.click();
    }})();"""
    st_javascript(js_function)


# adding of state is needed if you want to avoide empty div with iframe at the top of your page (flickers the page)
def trigger_download():
    st.session_state["trigger_download"] = True

st.write("Execute download function only on click!")
st.button("Download Text", on_click=trigger_download)
if st.session_state.get("trigger_download", False):
    st.session_state["trigger_download"] = False
    download_text("This is the text content to download.", "example.txt")

Nurgak · 2024-08-29T00:15:18Z

It seems this is a recurrent feature request. I too am looking for deferred pre-processing before the download is initiated.

The main issue with the current approach is when you need to remotely download (or otherwise on-the-fly pre-process) large files. As of now these need to be loaded into memory, assuming caching is not a solution to get up-to-date data, which would make page loading slow and might not even be needed by the user. In my case that would be database backup exports: many large files, stored remotely, the user might or might not be interested in getting one of them...

@jrieke You mentioned the use of st.fragment, how would that work in this use-case? Do you have an example? Or does this need further development to work with st.download_button/st.button?

Ideally, I'd like st.download_button to accept a function for the data field. That function would need to return a generator or some stream, so large files could be downloaded/pre-processed on-the-fly and then immediately served as chunks to the user, preventing excessive memory usage. I have not thought this through though... this approach might have some issues for very large files or long processing time per chunk...

laurafiorini · 2024-09-03T22:17:34Z

+1

anayjain · 2024-09-13T11:56:38Z

+1

emanuelbarbera · 2024-09-16T11:25:48Z

+1

gsportelli · 2024-11-05T12:19:25Z

+1

DavideBFerri · 2024-11-14T11:52:03Z

+1

turin1989 · 2024-11-20T16:22:41Z

+1

Jokip7 · 2024-11-21T09:33:03Z

+1

Vinno97 added type:enhancement Requests for feature enhancements or new features status:needs-triage Has not been triaged by the Streamlit team labels Jul 28, 2022

lukasmasuch added feature:st.download_button and removed status:needs-triage Has not been triaged by the Streamlit team labels Jul 28, 2022

kajarenc mentioned this issue Nov 8, 2022

[WIP] Add two-step process for download data, should fix issue #5586 #5639

Closed

9 tasks

carolinefrasca added status:in-progress We're on it! added-voting-callout and removed added-voting-callout labels Nov 10, 2022

MathCatsAnd mentioned this issue Jan 8, 2023

st.download_button: Create downloadable data only when button is clicked. #5899

Closed

kmcgrady mentioned this issue Jan 18, 2024

Convert and download markdown text as a pdf report #7894

Closed

4 tasks

chrieke mentioned this issue Mar 15, 2024

Export image chrieke/prettymapp#12

Open

jrieke added status:likely Will probably implement but no timeline yet and removed status:in-progress We're on it! labels Apr 19, 2024

Defered data for download button #5053

Defered data for download button #5053

Comments

Vinno97 commented Jul 28, 2022 • edited by carolinefrasca Loading

Problem

Solution

lukasmasuch commented Jul 28, 2022

tomgallagher commented Jul 29, 2022

jrieke commented Jul 30, 2022

xR86 commented Aug 29, 2022

jrieke commented Sep 23, 2022

amirhessam88 commented Dec 26, 2022

wolfgang-koch commented Jan 9, 2023

jzluo commented Jan 19, 2023

HStep20 commented Jan 28, 2023

masonearles commented Mar 14, 2023

ElenaBossolini commented Aug 12, 2023

SabraHealthCare commented Oct 28, 2023

andrewpimm commented Oct 30, 2023 • edited Loading

jsulopzs commented Dec 13, 2023

CharlesFr commented Dec 17, 2023

ViniciusgCaetano commented Jan 6, 2024

zbjdonald commented Jan 11, 2024

LarsHill commented Jan 13, 2024

anki-code commented Feb 8, 2024 • edited Loading

sfc-gh-pkommini commented Feb 27, 2024 • edited Loading

goyodiaz commented Feb 27, 2024

BenGravell commented Mar 10, 2024

iandesj commented Apr 3, 2024

jayco10125 commented May 12, 2024

jrieke commented May 14, 2024 • edited Loading

Update

kocielgr commented May 23, 2024

gauthamkumaran commented May 30, 2024

QuestMi commented May 31, 2024

effect

FJakovljevic commented Jul 4, 2024

Nurgak commented Aug 29, 2024

laurafiorini commented Sep 3, 2024

anayjain commented Sep 13, 2024

emanuelbarbera commented Sep 16, 2024

gsportelli commented Nov 5, 2024

DavideBFerri commented Nov 14, 2024

turin1989 commented Nov 20, 2024

Jokip7 commented Nov 21, 2024

Vinno97 commented Jul 28, 2022 •

edited by carolinefrasca

Loading

andrewpimm commented Oct 30, 2023 •

edited

Loading

anki-code commented Feb 8, 2024 •

edited

Loading

sfc-gh-pkommini commented Feb 27, 2024 •

edited

Loading

jrieke commented May 14, 2024 •

edited

Loading