add introduction and glossary to html docs

bratpiorka · bratpiorka · commit e6d50d24ec43 · 2024-01-10T10:31:57.000+01:00
diff --git a/scripts/assets/images/intro_architecture.png b/scripts/assets/images/intro_architecture.png
diff --git a/scripts/docs_config/api.rst b/scripts/docs_config/api.rst
@@ -1,5 +1,5 @@
 ==========================================
-Unified Memory Framework API Documentation
+API Documentation
 ==========================================
 
 Globals
diff --git a/scripts/docs_config/glossary.rst b/scripts/docs_config/glossary.rst
@@ -0,0 +1,100 @@
+Glossary
+==========================================================
+
+Homogeneous Memory
+  A set of memory used in a system that is composed of a single memory type of 
+  memory technology, managed by a singular driver using a uniform approach.
+
+Heterogeneous Memory
+  A set of memory used in a system that is composed of multiple types of memory 
+  technologies, each requiring distinct handling approaches often managed by 
+  separate drivers.
+
+Memory Tiering
+  An organization of different types of memory storage within a system, each 
+  having distinct characteristics, performance, and cost attributes. These 
+  memory tiers are typically orrganized in a hierarchy, with faster, more 
+  expensive memory located closer to the processor and slower, less expensive 
+  memory located further away.
+
+Memory Access Initiator 
+  A component in a computer system that initiates or requests access to the 
+  computer's memory subsystem. This could be a CPU, GPU, or other I/O and cache 
+  devices.
+
+Memory Target 
+  Any part of the memory subsystem that can handle memory access requests. This 
+  could be the OS-accessible main memory (RAM), video memory that resides on 
+  the graphics cards, memory caches, storage, and external memory devices 
+  connected using CXL.mem protocol, etc.
+
+Memory Page 
+  A fixed-length contiguous block of virtual memory, described by a single 
+  entry in the page table. It is the smallest unit of data for memory 
+  management in a virtual memory operating system.
+
+Enlightened Application 
+  An application that explicitly manages data allocation distribution among 
+  different types of memory and handles data migration between them. 
+
+Unenlightened Application 
+  An application that coexists with the underlying infrastructure (OS, 
+  frameworks, libraries) that offers various memory tiering and migration 
+  solutions without any code modifications.
+
+Memory Pool 
+  A memory management technique used in computer programming and software 
+  development, where relatively large, fixed-size blocks of memory are 
+  preallocated using one or more memory providers and then passed to a pool 
+  allocator for fine-grain management. The pool allocator could divide these 
+  blocks into smaller chunks and use them for application allocations depending 
+  on its needs. Typically pool allocators focus on the low fragmentation and 
+  constant allocation time, so they are used to optimize memory allocation and 
+  deallocation in scenarios where efficiency and performance are critical.
+
+Pool Allocator 
+  A memory allocator type used to efficiently manage memory pools. Among the 
+  existing ones are jemalloc or oneTBB's Scalable Memory Allocator.
+
+Memory Provider 
+  A software component responsible for supplying memory or managing memory 
+  targets. A single memory provider kind can efficiently manage the memory 
+  operations for one or multiple devices within the system or other memory 
+  sources like file-backed or user-provided memory. Memory providers are 
+  responsible for coarse-grain allocations and management of memory pages.
+
+High Bandwidth Memory (HBM)
+  A high-speed computer memory. It is used in conjunction with high-performance 
+  graphics accelerators, network devices, and high-performance data centers, as 
+  on-package cache on-package RAM in CPUs, FPGAs, supercomputers, etc.
+
+Compute Express Link (`CXL`_)
+  An open standard for high-speed, high-capacity central processing unit 
+  (CPU)-to-device and CPU-to-memory connections, designed for high-performance 
+  data center computers. CXL is built on the serial PCI Express (PCIe) physical 
+  and electrical interface and includes PCIe-based block input/output protocol 
+  (CXL.io), cache-coherent protocols for accessing system memory (CXL.cache), 
+  and device memory (CXL.mem).
+
+oneAPI Threading Building Blocks (`oneTBB`_)
+  A C++ template library developed by Intel for parallel programming on 
+  multi-core processors. TBB broke down the computation into tasks that can run 
+  in parallel. The library manages and schedules threads to execute these tasks.
+
+jemalloc 
+  A general-purpose malloc implementation that emphasizes fragmentation 
+  avoidance and scalable concurrency support. It provides introspection, memory 
+  management, and tuning features functionalities. `Jemalloc`_ uses separate 
+  pools (“arenas”) for each CPU which avoids lock contention problems in 
+  multithreading applications and makes them scale linearly with the number of 
+  threads.
+
+Unified Shared Memory (USM) 
+  A programming model which provides a single memory address space that is 
+  shared between CPUs, GPUs, and possibly other accelerators. It simplifies 
+  memory management by transparently handling data migration between the CPU 
+  and the accelerator device as needed.
+
+.. _CXL: https://www.computeexpresslink.org/
+.. _oneTBB: https://oneapi-src.github.io/oneTBB/
+.. _Jemalloc: https://jemalloc.net/
diff --git a/scripts/docs_config/index.rst b/scripts/docs_config/index.rst
@@ -7,4 +7,6 @@ Intel Unified Memory Framework documentation
 .. toctree::
    :maxdepth: 3
    
+   introduction.rst
    api.rst
+   glossary.rst
diff --git a/scripts/docs_config/introduction.rst b/scripts/docs_config/introduction.rst
@@ -0,0 +1,99 @@
+==============
+ Introduction
+==============
+
+The amount of data that need to be processed by modern workloads is continuously 
+growing. To address the increasing demand, memory subsystem of modern server 
+platforms is becoming heterogeneous. For example, High-Bandwidth Memory (HBM) 
+addresses throughput needs; the CXL protocol closes the capacity gap and tends 
+to improve memory utilization by memory pooling capabilities. Beyond CPU use 
+cases, there are GPU accelerators with their own memory on board. 
+
+The opportunities provided by modern heterogeneous memory platforms come 
+together with additional challenges. This means that additional software 
+changes might be required to fully leverage new HW capabilities. There are two
+main problems that modern applications need to deal with. The first one is 
+appropriate data placement and data migration between different types of memory. 
+The second one is how SW should leverage different memory topologies. 
+
+All applications can be divided into two big groups: enlightened and 
+unenlightened. Enlightened applications explicitly manage data allocation 
+distribution among memory tiers and further data migration. Unenlightened 
+applications do not require any code modifications and rely on underlying 
+infrastructure which is in turn enlightened. An underlying infrastructure 
+refers to not only the OS with various memory tiering solutions to migrate 
+memory pages between tiers, but also middleware: frameworks and libraries. 
+
+==============
+ Architecture
+==============
+
+The Unified Memory Framework (UMF) is a library for constructing allocators 
+and memory pools. It also contains broadly useful abstractions and utilities 
+for memory management. UMF allows users to manage multiple memory pools 
+characterized by different attributes, allowing certain allocation types to be 
+isolated from others and allocated using different hardware resources as 
+required. 
+
+A memory pool is a combination of a pool allocator and one or more memory 
+targets accessed by memory providers along with their properties and allocation 
+policies. Specifically, a memory provider is responsible for coarse-grained 
+memory allocations, while the pool allocator controls the pool and handles 
+fine-grained memory allocations. UMF provides distinct interfaces for both pool 
+allocators and memory providers, allowing integration into various 
+applications. 
+
+.. figure:: ../assets/images/intro_architecture.png
+
+The UMF library contains various pool allocators and memory providers but also 
+allows for the integration of external ones, giving users the flexibility to 
+either use existing solutions or provide their implementations. 
+
+Memory Providers
+================
+
+A memory provider is an abstraction for coarse (memory page) allocations and 
+deallocations of target memory types, such as host CPU, GPU, or CXL memory. 
+A single memory provider kind can efficiently manage the memory operations for 
+one or multiple devices within the system or other memory sources like 
+file-backed or user-provided memory.
+
+UMF comes with several bundled memory providers. Please refer to README.md 
+to see a full list of them. There is also a possibility to use externally 
+defined memory providers if they implement the UMF interface.
+
+To instantiate a memory provider, user must pass an additional context which 
+contains the details about the specific memory target that should be used. This 
+would be a NUMA node mask for the OS memory provider, file path for the 
+file-backed memory provider, etc. After creation, the memory provider context
+can't be changed.
+
+Pool Allocators
+===============
+
+A pool allocator is an abstraction over object-level memory management based 
+on coarse chunks acquired from the memory provider. It manages the memory pool 
+and services fine-grained malloc/free requests. 
+
+Pool allocators can be implemented to be general purpose or to fulfill 
+specific use cases. Implementations of the pool allocator interface can 
+leverage existing allocators (e.g., jemalloc or oneTBB) or be fully 
+customizable. The pool allocator abstraction could contain basic memory 
+management interfaces, as well as more complex ones that can be used, for 
+example, by the implementation for page monitoring or control (e.g., `madvise`).
+
+UMF comes with several bundled memory providers. Please refer to the README.md 
+to see a full list of them. There is also a possibility to use externally 
+defined pool allocators if they implement the UMF interface.
+
+Memory Pools
+============
+
+A Memory pool is a combination of a pool allocator and one or more memory 
+targets accessed by memory providers. In UMF the user could either use some 
+predefined memory pools or construct user-defined ones using the Pool Creation 
+API. 
+
+After construction, memory pools are used by the Allocation API as a first 
+argument. There is also a possibility to retrieve a memory pool from an 
+existing memory pointer that points to a memory previously allocated by UMF.