-
Notifications
You must be signed in to change notification settings - Fork 97
What options exist for extending markdown cells to support reports on multiple kernels? [Was: Can customized kernel hijack markdown cells?] #41
Comments
No, not currently. We had the discussion a long time ago, and want something similar and generic across languages. The current conclusion is that we will not allow it before markdown/common mark have a syntax that allows extensions, otherwise you end up writing your own parser, what we don't have the bandwith to do. You can (of course) decide to monkey patch the notebook with the custom extension, we can't prevent you to do that. |
I went through the discussions and I have to agree that jupyter should really try to resolve this issue to make the reports more dynamic, and thus more useful. I disagree with your point that jupyter has to provide a language-neutral solution because variable and expressions from different languages take different forms and it is impossible to settle down with a common syntax. All jupyter needs to do is provide a
In the case of python, with the introduction of new format string to version 3.6, there is no reason to think of any other syntax. |
Hiding code cells is also a possibility, which would make it easier to do such things as the output of a hidden code cell. |
There is a hack to do this: If you use a PlaceProxy widget, you can replace any html id tag with a widget. So you could in your markdown cell do: ## samples
There are <span style="display:inline-block;" id="nsamples"></span> samples with names <span style="display:inline-block;" id="sample_names"></span> and then in your code cell, do from ipywidgets import PlaceProxy, Label
from IPython.display import display
nsamples = Label('5')
sample_names = Label("'name1'")
display(PlaceProxy(nsamples, selector="#nsamples"))
display(PlaceProxy(sample_names, selector="#sample_names")) Then if you set, for example, This is really fragile - it depends on the markdown being rendered to html on the page when the widget is displayed. It doesn't persist to the notebook file. But it is a kludgy way to do what you're asking. |
@BoPeng - markdown cells are never sent to the kernel in the current notebook. They are rendered completely in the browser. |
I understand it now, but this does not have to mean it cannot be changed. I share the opinion with many others (ipython/ipython#2592) that RStudio is miles ahead of Jupyter for generating pretty reports and this is one of the key features that are lacking. The issue was discussed back in 2012 and I do not understand why it was never taken seriously (apologize if I am wrong here). |
@takluyver Yes, that would more or less work, but it is not ready, right? (I can see jupyter/notebook#534). I do not know how that operates, but users can enter
as code, evaluate it, and hide the cell. It is not as convenient as entering
as markdown though. |
It's not that we didn't take it seriously, it's that it's not obvious how to make it work intuitively, and there are thousands of things we want to do. Apart from the syntax issue Matthias mentioned (we really don't want to go making more ad-hoc special symbols for Markdown), there's an interface question - do we expand the variables in a Markdown cell when we render it, or keep them up to date as code is executed? The former is technically much simpler, but Markdown cells, unlike code cells, have no visible code or prompt number to indicate that they might need to be re-executed. Should they disappear or stay when the kernel is restarted, the page is reloaded, or the notebook is converted to another format? A format like knitr has this much easier, because it has a clear source document and output document. If that's what you want, have a look at knitpy, or the cross-language tool dexy. Doing this interactively is more challenging) (I'm sure you can come up with answers for the questions above, but my point is that the answers aren't obvious - something has to be hashed out between different use cases and priorities) |
I read arguments from both sides but I am on the side that we are NOT creating another special symbol for markdown. Because each language has its own syntax and there is no way we can find a commonly accepted one, the decision on a syntax should be made at the kernel level. The kernels decide what special symbols/syntax to use and how to translate them to proper markdown syntax . I mean, Jupyter does not have to add or understand anything. It simply feed the markdown cell to the kernel (if
No. The markdown cells are evaluated with shift-enter and users can re-evaluated them if needed. The users could
The markdown cells would be hide-source cells with both versions saved (like code cells), but only the evaluated version is displayed. This is exactly what users are seeing now: the markdown code is hidden unless we enter the cell.
I failed to understand why it is so challenging if we accept a bit of inconvenience (e.g. manual re-evaluation of markdown cells) caused by the interactive nature of workflow. |
BTW, I understand @Carreau's argument that we are adding special syntax if we introduce special syntax to markdown cells of the python kernel, which can in theory cause existing documents fail to render. However, with the introduction of new hook (
and let users decide which kernel they want to use. Again, it does not have to be jupyter's job to decide on special syntax for any kernel. |
@BoPeng - what about doing this as a cell magic?
Cell magics are sent to the backend where the markdown can be rendered to html and sent back to the browser. Then the only piece missing is easily hiding the input and easily showing the input if you want to edit the source. |
Yes, this would certainly work. I just need to decide if I should add another way to specify report because users can already enter |
That untrue, we want to insert value in markdown, so it's enough to find a caracter which work in markdown.
That an opinion, not a truth, and no after long reflexion we think that doing that at kernel level is a bad idea for many reason:
That would require an enormous change in protocol, and break the main abstraction that kernels are only for computation, and there cannot be "if do_markdown" is defined, as this is a protocol you should conform to, and multiple client can be connected.
Again that's your opinion, some people do not like this solution,
The concept is interesting, and does make sens in some use case, and it is perfectly reasonable in some workflow. We did went through some of your ideas a few years ago, and after reflexions and prototypes we really things these are not the solution we like. Plus we have to maintain backward compatibility. So I think you underestimate the cost, advantages/drawbacks of such solutions, there are likely some pros and cons, and good ideas. Like I personally, of opinion that Shift-Entrer requerying variable to the kernel for a markdown cell is a good idea, with another key to render without changing the current value is one thing I like. But I recognize that for user it is highly confusing, and for non modal frontend, it would be a nightmare to come up with the right semantics. So we would be happy to see prototype, but it's likely not a direction we will take, at least not soon. Would be happy to proven wrong if you want to hack a prototype. |
I'm not sure what you mean here. What do ! lines have to do with an |
Sorry for the confusion. In SoS, reports are marked by lines with As a side question, there is no difference between line and cell magics in jupyter notebook, right? Because notebook would not evaluate a line starts with |
I think we have some serious misunderstanding on how dynamic markdown works so let me explain. When Jupyter sees
Your idea is to identify But this would not work if we are to allow arbitrary expressions because Jupyter does not know the syntax of the underlying language and does not know where to stop the pattern. For example, if a language happens to use
Jupyter would send an invalid expression |
The point is not to send arbitrary expression. You can't send arbitrary expression that have side effects, that would bread the assumption that rendering a text has no side effects. |
I've already mentioned that I don't want to make ad-hoc extensions to Markdown syntax, so the idea of encouraging kernels to each make their own ad-hoc extensions, and then implement mini parsers for them, is definitely not something I want. However, why not try doing this as a Javascript extension? You could hook the rendering of a Markdown cell and send it to your kernel to do what you want with. |
I do not get it. As soon as you send something to the kernel, there can be side effects. It does not matter what Jupyter think it is, variable or expression, the kernel can see it differently. For example, if you send So the resistance is on the dynamic markdown itself, not what syntax to use and what send to the kernel, and I do not understand why. It is true that rending a text should have no side effect, but we are rending dynamic text with side effect. |
I personally see nothing wrong for a kernel to make use of such features to generate better notebooks and I regret to see that your decision is based on your personal preference, not users' need. To me, not able to use ``r 1+1
I have close-to-zero knowledge on javascript... and I need to make sure that that is a proper way to implement such a kernel feature when I develop my own kernel. |
IIRC the protocol explicitly define Fair enough for bash it seem different but nothing prevent you from doing a bash kernel that check wether something has side effects.
No the resistance is on:
In the end, we spend time implementing and maintaining, while user need is relevant, it is weighted by usage, and the burden on our time, difficulty and maintenance, and other deadline we have. We are happy to revisit with more manpower as this is something we want to do. Right now despite the fact that we seem like a big project, we are a small team with many things to work on. If someone actually regularly involved in the project that contribute show an interest in pushing that forward, we'll likely revisit the decision, but currently no member of the core team have the time, and many of our discussions make us to believe that there is more work involved than it appears at first look.
Happy to help you getting acquainted with it. None of us were javascript developers when we started this. |
@BoPeng - you're wanting to generate reports, right? Can you use knitpy and write .pymd files? Is that equivalent to what rstudio offers? |
I am using multiple languages and would like to have an environment that I can work with R, python, shell etc, with ability to generate reports. The project we are working on is called script of scripts, which is a workflow system with a Jupyter kernel. The kernel can execute SoS steps, and can start and switch between sub-kernels (e.g. IRKernel) in the same notebook. You can have a preview of a jupyter notebook with SoS kernel at https://github.com/BoPeng/SOS/blob/master/examples/example.ipynb (no installation of sos is needed thanks to github). |
Are you wanting to have text that displays in the notebook showing the current values of variables interjected into text (e.g., a cell that you evaluate and you get text/values intermingled)? Or are you wanting to have a one-step conversion from a notebook to a separate report (which actually may be a notebook, or could be a separate file) that has values substituted in various places? Can you write a quick simple mockup of what you want the final result to be? Like what would it look like in the context of your example.ipynb? |
For SoS, reports from each step will be evaluated (with variables and expressions replaced by their values) and collected to form a final report file. There is no requirement on the format of this report but it is usually in markdown format and users can either process it outside of SoS, or add a step to their workflow to call, for example RMarkdown to generate reports in HTML or PDF formats. I am having a hard time to decide how reports should behave in Jupyter. Ideally, it could be dynamic markdown cells that can be evaluated by SoS. In this way the Jupyter notebook will display results of SoS steps as a report with the Now that Jupyter does not allow dynamic markdown, showing reports in markdown cells is not good enough so I will have to handle them in code cells (e.g. with a
As you can see, although Jupyter can still be used to preview the report, the notebook does not look like a report anymore and it is likely that users have to post-process reports generated by SoS, which is something I would really like to avoid. |
When I say that "I don't want" ad hoc extensions to Markdown, I'm using it as a shorthand for "I don't believe this is a good thing for users in the long term". It means the markdown in notebooks would be less generalisable, relying on a specially extended engine to render it correctly, and it's a source of potential future bugs as we have to maintain extra parsing machinery. You seem to be pushing hard for a solution that will work for your particular use case, without seriously working out how it fits into the notebook model as a whole. Your "bit of inconvenience caused by the interactive nature of workflow" is indicative. The notebook is an interactive tool, and it's not trying to be knitr. Doing something like this well requires thinking about interactivity, and that's a harder problem than batch processing. We are happy to help you do what you want as an extension, but I don't see this as something that should go into the protocol and the notebook application as it stands. |
As I have said, I gave up ipython notebook a few years ago because it lacks some features as a notebook tool. I am certainly not alone as many users in the 2012 thread shared the same concern. Although I have learned a lot about Jupyter in the past few days through the development of the SoS kernel, I am not familiar with Jupyer core and I am certainly not in a position to 'seriously working out how it fits into the notebook model'. However, from a user's point of view, I am working in an interactive environment with python or R, or some other language, if a code cell can be evaluated, if the markdown cell also need be evaluated (Ctrl-Enter), it is only nature for a markdown cell to integrate better with the shell and present what is happening there. This is my perception of the "notebook model", although it does not agree completely with the model of Jupyter. |
My concern is that there's no clear indication on a Markdown cell of when it was executed or even that it might need to be reexecuted at all. Also, I think markdown cells that have to be 'run' to be up to date could soon be implemented with a cell magic that produces Markdown output and hides the code cell. This won't require any changes to the protocol or the notebook (beyond the cell hiding that we're already planning). I think a neater approach for the notebook might be to use HTML tags like |
All, I've updated the title of this issue because the original title implies a security issue on quick scan of titles (i.e. "hijack"). I've updated the title of this issue to better reflect the discussion within the issue. |
There is no need for guessing with the
Does this work for all kernels? What I learned so far is that kernels have to implement their own magics.
There is a lot of work to enter |
I guess this represents the decision from the team. About 15 years ago when I was a graduate student with plenty of spare time, I contributed a lot to an open source project for similar reasons, but right now I do not have the bandwidth to contribute to Jupyter. Hopefully someone else with more time and skill can contribute such a feature. I think I will continue to use markdown cells for static contents such as headers for SoS notebooks, and use code cells for dynamic reports. It is not pretty (e.g. cell 4 and 5 of the updated sos notebook example but the coming hide-cell feature can hopefully be the remedy. Finally, I appreciate your hard work for making such a wonderful platform and your patience to explain to me the difficulties. I apologize if I sounded a bit harsh in my responses, but there was certain no hard feelings there. |
That's why one of my prototype was doing, but with a {{}} syntax in markdown to avoid the work to write
I never decided whether to update explicitly, or automatically, and technically you could have 2 syntax that each would have had a different behavior. like
No worry, it is always hard to convey the tone in writing, and we are really happy to see you creating the SOS kernel, with Jupyter getting traction we can hopefully have some people interested in tackling this problem and each conversation like these is alway a good source of information that help us to shape what it needed and hopefully get a solution at some point. |
At the spring dev meeting, there was a brief discussion about making calls for proposals (CFP) to request help from people with time and expertise in investigating and implementing big ticket features (e.g., internationalization). I get a sense this idea falls into the same basket. I don't think spending time writing up big, formal CFPs is the right approach at the moment. But is there something we can be doing to make these research topics / big ideas / help wanted more discoverable by people who are looking to dive in to something big in Jupyter? Off the cuff: apply a tag like "research project", aggregate that tag on a nice ideas.jupyter.org page by calling the github API. (Of course building the latter requires work too, so socializing a GitHub query to find all issues with the tag might be a simple place to start.) |
@parente Nice idea. I've gone ahead and tagged this issue with your suggested terms. |
Simple solution 1) Simple solution 2) In non-template cell,
Would get rendered literally.
In a Template cell it would get rendered as This is some text
|
A major concern here is that there is no consensus on the sigil to use for variable interpolation, and even if there is a consensus (e.g. use |
Oh, and I didn't really make clear how my idea addresses using dynamic formatting based on variables. Here is how it would work: Code
Non-Template Cell Output:# Something about a Template Cell Output:Something about aThe value of |
I see your point. This is indeed a brilliant method to avoid the sigil choice problem. Users are free to produce any text in any language-specific way and there is no need for any specific kernel support. If I understand correctly, all that is needed is that
It is not as easy as string interpolation though. I mean, instead of writing
users would have to write
to display
|
Right, exactly. Just as in a templating language for websites. If in a template cell, only stdout is printed, and it is rendered as markdown. Up to the user to make sure they send what they actually want to stdout. |
Given that it is unrealistic to have all kernels to support a new string interpolation feature, your proposal sound like a nice compromise. Are you going to implement it or start a new ticket to propose to the core developers? Rstudio has had inline string interpolation for many years, and Rstudio notebook already has nice multi-lanaguge support (something my SoS Jupyter kernel is working on), I believe Jupyter should move fast on features like this to catch up. |
Oh golly, I'm not nearly a capable enough programmer to tackle it. And I don't know any of Jupyter's innards. I think I need to lame out and start a feature request ticket. But if someone could line me out on part of the task, I could give it a try. And yes, Jupyter needs to keep up. |
Hi all,
The discussion about that kind of things started about 4 years ago (OMG, it
feels like I started contributed like yesterday), when things were still
IPython notebook, and had a prototype. (
ipython/ipython#2592)
While we would _really like_ to have such solution, the implication are
really touching everything, and so far I believe that we won't commit to
accepting something like that unless we are sure there is a long term
engagement.
We do miss though an essay on pro, cons, gotchas, advantage and state of
the art about what is currently existing.
Having something like that would already be a really good first step.
Thanks !
--
M
…On Mon, Mar 6, 2017 at 8:13 AM, Ariel Balter ***@***.***> wrote:
Oh golly, I'm not nearly a capable enough programmer to tackle it. And I
don't know any of Jupyter's innards. I think I need to lame out and start a
feature request ticket. But if someone could line me out on part of the
task, I could give it a try. And yes, Jupyter needs to keep up.
—
You are receiving this because you were mentioned.
Reply to this email directly, view it on GitHub
<#41 (comment)>, or mute
the thread
<https://github.com/notifications/unsubscribe-auth/AAUez_qBN51vCfWs2sQPPKc31Qhtb2eGks5rjDCqgaJpZM4IeO6x>
.
|
Could you elaborate on what @abalter's proposal would touch? It is to my understanding that this proposal is the least intrusive one throughout the discussions because no kernel support is required. This kind of cells behaves on the frontend as markdown cells (execute to render MD, hide input, double-click to show code) and the only difference is that the cells are sent to the kernel and are evaluate as regular cells. Actually SoS has tried to implement some sort of active markdown (vatlab/sos#390), but failed because of my lack of knowledge on Jupyter frontend and JS skills. Perhaps this can be a new direction that we can try. |
Here is an idea: What if it was simply a type of magic:
|
Yes. Among other, it would change the notebook spec, which we are not against but is a lot of work. Moreover you need the kernel to produce the cell, which is problematic for things like nbconvert. You will of course need to have the kernels adopt your formatting, and be able to render whatever correctly. We already have issues with user typing non-standard markdown and using esoteric extensions, if kernel authors have the slightest mistake in their rendering code, then notebooks will become "wrong", and it's going to be a mess. Think also that we can't guaranty kernel-evaluation will not have side effects. There is not much difference either from use a Code Cell that ends with So while things like that seem "Obvious" and "simple" we had long discussions, prototypes, they have large consequences on maintenance. And while this is something we want, we'd like to do it right, and that's means not only thinking about it, but also following up on it, maintaining it, discussing with publisher what they would accept, what they need. |
In the end I implemented @abalter's idea in the SoS kernel with a This does not work at all like a Markdown cell. It is more troublesome to use but it is more powerful (e.g. support HTML, SVG, and other formats) and flexible. The bottom line is that it allows the generation of markdown content in any kernel using any method, which of course includes string interpolation. With the help of Jupyter extensions such as hide-code, users can hide the input of the cell so only the generated output is displayed. I could add an option |
I am working on a kernel where the underlying workflow system can generate reports based on results of other steps. For example,
would generate
I would like to store these report lines in jupyter as markdown cells but currently the kernel can only handle script cells through
do_execute
. Is there anyway that I can hijack markdown cell, change its content (expand${samples}
etc) before it is rendered?The text was updated successfully, but these errors were encountered: