From 6e59a7fb068c2e89fa1fa4ed7f29e63d5ced6018 Mon Sep 17 00:00:00 2001 From: Shagun Sodhani Date: Sun, 12 Feb 2023 14:09:40 -0500 Subject: [PATCH] Add toolformer paper --- ...mer-Language-Models-Can-Teach-Themselves-to-Use-Tools.html | 2 +- atom.xml | 4 ++-- 2 files changed, 3 insertions(+), 3 deletions(-) diff --git a/Toolformer-Language-Models-Can-Teach-Themselves-to-Use-Tools.html b/Toolformer-Language-Models-Can-Teach-Themselves-to-Use-Tools.html index d34cb91..ee172aa 100644 --- a/Toolformer-Language-Models-Can-Teach-Themselves-to-Use-Tools.html +++ b/Toolformer-Language-Models-Can-Teach-Themselves-to-Use-Tools.html @@ -200,7 +200,7 @@

Approach

Starting with a language model, M, the goal is to enable the language model to use tools by invoking API calls.

  • -

    An API call is denoted by the tuple $c = (api-name, api-input)$. It can be linearized as $e(c) = [api-name(api-input)]$ or as $e(c, r) = [api-name(api-input) -> r]$ where $r$ denotes the result of the API.

    +

    An API call is denoted by the tuple $c =$ (api_name, api_input). It can be linearized as $e(c) =$ [api_name(api_input)$]$ or as $e(c, r) = [$api_name(api_input) $ -> r]$ where $r$ denotes the result of the API.

  • The given dataset of plain text, $C$, is converted into a dataset $C*$ augmented with the API calls using a three-step process.

    diff --git a/atom.xml b/atom.xml index a68100d..26ef666 100644 --- a/atom.xml +++ b/atom.xml @@ -4,7 +4,7 @@ Papers I Read - 2023-02-12T14:01:15-05:00 + 2023-02-12T14:08:58-05:00 https://shagunsodhani.github.io/papers-I-read Shagun Sodhani @@ -35,7 +35,7 @@ <p>Starting with a language model, M, the goal is to enable the language model to use tools by invoking API calls.</p> </li> <li> - <p>An API call is denoted by the tuple $c = (api-name, api-input)$. It can be linearized as $e(c) = [api-name(api-input)]$ or as $e(c, r) = [api-name(api-input) -&gt; r]$ where $r$ denotes the result of the API.</p> + <p>An API call is denoted by the tuple $c =$ (api_name, api_input). It can be linearized as $e(c) =$ [api_name(api_input)$]$ or as $e(c, r) = [$api_name(api_input) $ -&gt; r]$ where $r$ denotes the result of the API.</p> </li> <li> <p>The given dataset of plain text, $C$, is converted into a dataset $C*$ augmented with the API calls using a three-step process.</p>