From 7eb0f0c8395f85e7f98e757bd166635680e74052 Mon Sep 17 00:00:00 2001 From: Shagun Sodhani Date: Mon, 14 Dec 2020 06:52:30 -0500 Subject: [PATCH] Add two new papers --- README.md | 2 + ...t, Yield, and Scalable Tolerant Systems.md | 59 +++++++++++++++++++ ...28-A Foliated View of Transfer Learning.md | 56 ++++++++++++++++++ site/_site | 2 +- 4 files changed, 118 insertions(+), 1 deletion(-) create mode 100755 site/_posts/2020-09-21-Harvest, Yield, and Scalable Tolerant Systems.md create mode 100755 site/_posts/2020-09-28-A Foliated View of Transfer Learning.md diff --git a/README.md b/README.md index 578b9c61..45577da2 100755 --- a/README.md +++ b/README.md @@ -4,6 +4,8 @@ I am trying a new initiative - a-paper-a-week. This repository will hold all tho ## List of papers +* [A Foliated View of Transfer Learning](https://shagunsodhani.com/papers-I-read/A-Foliated-View-of-Transfer-Learning) +* [Harvest, Yield, and Scalable Tolerant Systems](https://shagunsodhani.com/papers-I-read/Harvest,-Yield,-and-Scalable-Tolerant-Systems) * [MONet - Unsupervised Scene Decomposition and Representation](https://shagunsodhani.com/papers-I-read/MONet-Unsupervised-Scene-Decomposition-and-Representation) * [Revisiting Fundamentals of Experience Replay](https://shagunsodhani.com/papers-I-read/Revisiting-Fundamentals-of-Experience-Replay) * [Deep Reinforcement Learning and the Deadly Triad](https://shagunsodhani.com/papers-I-read/Deep-Reinforcement-Learning-and-the-Deadly-Triad) diff --git a/site/_posts/2020-09-21-Harvest, Yield, and Scalable Tolerant Systems.md b/site/_posts/2020-09-21-Harvest, Yield, and Scalable Tolerant Systems.md new file mode 100755 index 00000000..57a0f9a7 --- /dev/null +++ b/site/_posts/2020-09-21-Harvest, Yield, and Scalable Tolerant Systems.md @@ -0,0 +1,59 @@ +--- +layout: post +title: Harvest, Yield, and Scalable Tolerant Systems +comments: True +excerpt: +tags: ['1999', 'Distributed Systems', 'Operating Systems', CAP, OS, Scale] + + +--- + +## Introduction + +* A classic paper that looks into strategies for scaling large systems that can tolerate graceful degradation. + +* [Link to the paper](https://dl.acm.org/doi/10.5555/822076.822436) + +## CAP Theorem + +* CAP refers to strong **C**onsistency, high **A**vailability, and **P**artitionability. + +* Strong consistency refers to single copy ACID consistency. + +* High availability means any consumer can access the data anytime. Generally, this is achieved by adding one or more data replicas. + +* Partitionability means that the system can survive a partition between the different replicas. + +* Strong CAP theorem states that any system can have only two out of three properties. + +* Weak CAP theorem says that stronger are the guarantees about any two properties, weaker are the third property's guarantees. + +## Harvest, Yield, and CAP Theorem + +* Assume that the clients are making a request to a server. + +* There are two quantities of interest here: + + * Yield - the probability of completing a request. + * Harvest - completeness of answer to a query. + +* In the presence of faults, a tradeoff can is made between yield and harvest. This tradeoff applies to both read and update queries. + +## Two strategies for scaling systems + + +### Trading Harvest for Yield + +* In a hundred node cluster (without replication), a single-node failure reduces harvest by 1 %, and in the case of multi-node failure, the harvest degrades linearly. + +* The probability of losing high-priority data can be reduced by replicating it. However, replicating all the data would not n guarantee 100% harvest and yield despite significant costs. + +### Application Decomposition and Orthogonal Mechanisms + +* Decompose a large application into subcomponents so that each component can be provisioned separately. Strong consistency can only be applied only on the components that need it, instead of the application as a whole. + +* Further, failure of one or more components need not cause the application to fail as a whole. + +* Decomposition also provides the opportunity to use orthogonal mechanisms, i.e., mechanisms independent of other mechanisms with no runtime interface. + +* Composition of orthogonal subsystems improves the robustness of runtime interactions by *locally* containing the errors. For example, the orthogonal components can be restarted /replaced independently without affecting other running components. \ No newline at end of file diff --git a/site/_posts/2020-09-28-A Foliated View of Transfer Learning.md b/site/_posts/2020-09-28-A Foliated View of Transfer Learning.md new file mode 100755 index 00000000..39e72d9f --- /dev/null +++ b/site/_posts/2020-09-28-A Foliated View of Transfer Learning.md @@ -0,0 +1,56 @@ +--- +layout: post +title: A Foliated View of Transfer Learning +comments: True +excerpt: +tags: ['2020', 'Transfer Learning', AI, Geometry, Theory] + + +--- + +## Introduction + +* The paper presents a formalism for transfer learning, offers a definition of relatedness between tasks, and proposes foliations as a mathematical framework to represent the relationship between tasks. + +* [Link to the paper](https://arxiv.org/abs/2008.00546) + + +## Summary + +* The term *representation* denotes a mechanism for *describing* and *realizing* abstract objects, thus allowing manipulation and reasoning about the objects. This description goes beyond the usual meaning (in deep learning), where *representation* denotes some useful information about data. + +* *Relatedness* describes *what* changes between tasks. Consider a set of transformations (or functions) that convert one task to another. A *relationship* between two tasks is an element of this transformation set. + +* Given a transformation set, one can define a *set of related tasks*, which is the set of all the tasks that can be transformed into each other using the functions from the given transformation set. This set of tasks is an equivalence class, and the transformation set is the equivalence relationship. + +* Given two related tasks *t1* and *t2*, denote the corresponding models (trained on those tasks) as *m1* and *m2*. One can assume that *m1* and *m2* are related in the same way as *t1* and *t2* (equivariance). + +* Now, given a set of transformations, one can partition the space of continuous functions into non-overlapping spaces, which describe a set of related tasks. These spaces are referred to as the *parallel spaces* or *transfer spaces*. + +* The parallel space represents a lower dimension than the original space. So knowing which parallel space a model lies on can make it easier to find it. This is the primary motivation behind transfer learning - knowing the relationship between tasks can make it easier to find a solution to new tasks. + +* Another way of partitioning the set of transformations is to use tessellation (e.g., Voronoi diagrams). Tasks in the same partition are similar to each other as compared to a task from another partition. + +* Two tasks are defined as *similar* if the distance between them (under some distance metric) is small. + +* Similarity is a *geometric* notion, while relatedness is a *transformative* notion. Parallelized space is to relatedness what tessellation is to similarity. + +* The distinction between similarity and relatedness is quite nuanced, and the authors provide several examples to differentiate between them. + +* Similarity can only be measured in terms of a reference element (similar to what). For example, when one finetunes a pre-trained model on a new task, one assumes that the model's pretraining task is similar to the current task. + +* Given a set (say *T*), a *quantity* (a function that maps elemenets of *T* to a *k* dimensional vector) is said to be *invariant* with respect to a transformation *p* (defined on *T*) if *q(f) = q(p(f))* ie the value of *f* (belonging to *T*) does not change if *f* is transformed by *p*. + +* If one assumes that the set of transformations is a group, specifically a Lie group whose action on the set of tasks is locally free and regular, then one can define a parallel partitioning of the space of tasks and the space of models. + +* One can develop a hierarchial categorization scheme for the set of all considered tasks using the invariant quantities. + +* One can consider the space of tasks and models to be smooth manifolds as manifolds naturally give a notion of representation and transformations between them. + +* A manifold is a topological space that can be locally mapped to a Euclidean space using coordinate charts. One can define regular foliation by choosing charts that satisfy certain conditions. In that case, the manifold has immersed, connected, non-intersecting submanifolds called leaves. + +* The charts (that satisfies those conditions) give a set of rectified coordinates, where the notions of "which leaf a point is on" and "where on the leaf it is" are clearly separated. + +* Thus, foliation can provide the theoretical tools to work with parallel spaces. + +* How can the foliations be incorporated into theory and solutions for transfer learning is left aa future work. \ No newline at end of file diff --git a/site/_site b/site/_site index 623ce9d6..c0e9649e 160000 --- a/site/_site +++ b/site/_site @@ -1 +1 @@ -Subproject commit 623ce9d663fa823837f7168bc35f61e145ee3baf +Subproject commit c0e9649eda6b04c9c4d68f40621d93bac32fa3a0