Reference Material

This section records information for easy reference.

Abbreviations and Acronyms

symboldescription
ACAnticipated Change
DOMDocument Object Model
GPUGraphics Processing Unit
MModule
MGModule Guide
OSOperating System
PDFPortable Document Format
RRequirement
SRSSoftware Requirements Specification
tinymorphthe text editor that helps you to become better writer
UCUnlikely Change

7.2 Data Dictionary

NameContentType
tinymorphmorph + ReverseProxy + asteraceaepackage
morphWeb interface for tinymorphpackage
ReverseProxyA middleware for reverse proxy with load balancermodule
asteraceaeSearch + ActivationCache + SAEs + logit_processor + Exopackage
logit_processorA function to convert incoming requests to logitsfunction
exoInference engine to run given LLMpackage
LLMopen-weights models to be run for inferencepackage
BlockManagerHelp manage KV-cache during inferenceclass
SAEsa set of sparse autoencoders trained against given LLM to steer activations generationpackage
SearchAdditional search tool to improve correctnessmodule
ActivationCacheStore said generations activations for performance purposesclass
streaming JSONOutputs from asteraceae will be streamed back to morphtext
tensor matrix represented inputs processed by logit_processortext

see also logits and sparse autoencoders

Link to original

Purpose

The following encapsulates both system and document purposes for tinymorph

System Purpose

The system is a web-based text editor designed to inspire creativity and make the writing process smoother and more enjoyable. It helps users explore their ideas in a non-linear way, offering interactive suggestions powered by advanced machine learning. With features like text steering, feedback, and note-style suggestions, the system gives writers the flexibility to refine their tone and structure while keeping full control over their work. Everything is stored locally on the user’s computer, so there’s no need for logins or worrying about data privacy. Whether you are working on creative writing or structured documents, the system provides tools to plan, brainstorm, and fine-tune ideas. It’s accessible through any modern web browser, ensuring a seamless experience without the hassle of installation or compatibility issues.

Document Purpose

The purpose of this document is to illustrate and justify the decomposition of the tinymorph system into its core components and provide a Modular Guide for each module. This document will serve as a guide for the implementation of the system, ensuring all components are designed to achieve the system’s goals effectively and maintain scalability, modularity, and user-friendliness. This document will also serve as a basis for the implementation work scheduled to be completed in the coming months.

Introduction

Decomposing a system into modules is a commonly accepted approach to developing software. A module is a work assignment for a programmer or programming team (Parnas, 1972). We advocate a decomposition based on the principle of information hiding (Parnas, 1972). This principle supports design for change, because the “secrets” that each module hides represent likely future changes. Design for change is valuable in SC,… where modifications are frequent, especially during initial development as the solution space is explored.

Our design follows the rules layed out by (Parnas et al., 1984), as follows:

  • System details that are likely to change independently should be the secrets of separate modules.
  • Each data structure is implemented in only one module.
  • Any other program that requires information stored in a module’s data structures must obtain it by calling access programs belonging to that module.

After completing the first stage of the design, the Software Requirements Specification (SRS), the Module Guide (MG) is developed (Parnas et al., 1984). The MG specifies the modular structure of the system and is intended to allow both designers and maintainers to easily identify the parts of the software. The potential readers of this document are as follows:

  • New project members: This document can be a guide for a new project member to easily understand the overall structure and quickly find the relevant modules they are searching for.
  • Maintainers: The hierarchical structure of the module guide improves the maintainers’ understanding when they need to make changes to the system. It is important for a maintainer to update the relevant sections of the document after changes have been made.
  • Designers: Once the module guide has been written, it can be used to check for consistency, feasibility, and flexibility. Designers can verify the system in various ways, such as consistency among modules, feasibility of the decomposition, and flexibility of the design.

The rest of the document is organized as follows. Section 3 lists the anticipated and unlikely changes of the software requirements. Section 4 summarizes the module decomposition that was constructed according to the likely changes. Section 5 specifies the connections between the software requirements and the modules. Section 6 gives a detailed description of the modules. Section 7 includes two traceability matrices. One checks the completeness of the design against the requirements provided in the SRS. The other shows the relation between anticipated changes and the modules. Section 8 describes the use relation between modules.

Anticipated and Unlikely Changes

This section lists possible changes to the system. According to the likeliness of the change, the possible changes are classified into two categories. Anticipated changes are listed in Section 2.1, and unlikely changes are listed in Section 2.2.

Anticipated Changes

Anticipated changes are the source of the information that is to be hidden inside the modules. Ideally, changing one of the anticipated changes will only require changing the one module that hides the associated decision. The approach adapted here is called design for change.

  • AC1: Integrating with newer language models, such as Llama 3.
  • AC2: Support for exploration of internal embeddings visualisation
  • AC3: More analytics to enable better understanding of users’ writing

Unlikely Changes

The module design should be as general as possible. However, a general system is more complex. Sometimes this complexity is not necessary. Fixing some design decisions at the system architecture stage can simplify the software design. If these decision should later need to be changed, then many parts of the design will potentially need to be modified. Hence, it is not intended that these decisions will be changed.

  • UC1: Vertical integration with tools like Notion, Obsidian, and text editors such as Neovim and VSCode.
  • UC2: Building a conversational UI (users can use ChatGPT or other LLMs-based chat UI for that.)
  • UC3: collect anonymous metrics (product will be provided as-is.)

Module Hierarchy

This section provides an overview of the module design. Modules are summarized in a hierarchy decomposed by secrets in Table 1. The modules listed below, which are leaves in the hierarchy tree, are the modules that will actually be implemented.

Level 1Level 2
Hardware-HidingGPU/Hardware Acceleration Module
Behaviour-HidingEditor Module
Notes Module
Graph View Module
Settings Module
Rendering Module
Data Fetching and State Management Module
Software DecisionInference Module
User Configuration Module
Analytics Module
Export and Integration Module

Table 1: Module Hierarchy

Hardware-Hiding Module(s)

  • M1:GPU/Hardware Acceleration Module

Behaviour-Hiding Module(s)

  • M2: Editor Module
  • M3: Notes Module
  • M4: Graph View Module
  • M5: Settings Module
  • M6: Rendering Module
  • M7: Data Fetching and State Management Module

Software Decision Module(s)

  • M8: Inference Module
  • M9: User Configuration Module
  • M10: Analytics Module
  • M11: Export and Integration Module

Connection Between Requirements and Design

The design of the system is intended to satisfy the requirements developed in the SRS. In this stage, the system is decomposed into modules. The connection between requirements and modules is listed in Table 2. This section documents the primary design decisions made to satisfy the requirements specified in the SRS. By explicitly recording these decisions, we ensure clear traceability between requirements and their implementation in the modular architecture.

Text Editor Core (Requirements FR-1 to FR-4)

  • Decision to implement a web-based editor using a modular architecture separating the UI components (Editor Module) from the underlying text processing logic
  • Implementation of real-time planning suggestions through dedicated Notes Module and Graph View Module
  • Storage of user preferences and configurations locally through the Settings Module
  • Integration with SAEs for tone and style customization via the Inference Module

Document Management (Requirements FR-5 to FR-8)

  • Use of local storage for file management to maintain the file-over-app philosophy
  • Implementation of version control through a graph-based data structure
  • Separation of rendering logic from document structure to support multiple export formats
  • Design of an extensible state management system for handling document metadata and user preferences

Performance and Scalability (Requirements PR-SLR1, PR-SLR2)

  • Integration of GPU acceleration for compute-intensive tasks like inference
  • Implementation of KV cache optimization in the Inference Module for improved TTFT
  • Design of asynchronous processing pipeline for suggestion generation
  • Use of efficient data structures for managing document state and history

Security and Privacy (Requirements SR-INT1 to SR-P1)

  • Decision to keep all user data local, avoiding cloud storage
  • Implementation of secure communication protocols for inference requests
  • Design of permission system for accessing local files
  • Integration of content filtering mechanisms in the Inference Module

User Experience (Requirements UH-EOU1 to UH-A3)

  • Implementation of keyboard navigation and accessibility features in the Editor Module
  • Design of intuitive interface components for suggestion interaction
  • Integration of theme customization through the Settings Module
  • Development of responsive layout system for cross-device compatibility

Module Decomposition

Modules are decomposed according to the principle of “information hiding” proposed by (Parnas et al., 1984). The Secrets field in a module decomposition is a brief statement of the design decision hidden by the module. The Services field specifies what the module will do without documenting how to do it. For each module, a suggestion for the implementing software is given under the Implemented By title. If the entry is OS, this means that the module is provided by the operating system or by standard programming language libraries. \progname{} means the module will be implemented by the \progname{} software. Only the leaf modules in the hierarchy have to be implemented. If a dash () is shown, this means that the module is not a leaf and will not have to be implemented.

Hardware-Hiding

GPU/Hardware Acceleration Module (M1)

  • Secrets: Algorithms for utilizing GPU resources to enhance performance, such as optimizing parallel computations for feature steering.
  • Services: Accelerates computationally intensive tasks, such as generating text or rendering visualizations, by using GPU capabilities.
  • Implemented By: GPU drivers, WebGL (if applicable).
  • Type of Module: Abstract Object.

Behaviour-Hiding

Editor Module (M2)

  • Secrets: State management of the editor, including virtual DOM updates, handling of formatting rules, and integration of context-sensitive suggestions.
  • Services: Serves as the maintain interface for text editing, providing features such as structured editing, real-time text previews, and contextual interactions. Supports seamless file parsing, note suggestions, and user-driven customizations.
  • Implemented By: Frontend libraries.
  • Type of Module: Abstract Data Type.

Notes Module (M3)

  • Secrets: Storage mechanisms, data structures for dynamic note updates, and algorithms linking notes to specific contexts within the document.
  • Services: Enables users to create, edit, delete, and manage notes in real time. Dynamically generates suggestions based on the document’s context, aiding the user’s creative process.
  • Implemented By: Frontend libraries, Machine learning models.
  • Type of Module: Library.

Graph View Module (M4)

  • Secrets: Algorithms for translating document structures into graph visualizations and efficiently synchronizing the graph with real-time document edits.
  • Services: Provides users with a graphical overview of the document’s structure to aid in navigation and organization. Updates dynamically to reflect changes made in the editor, ensuring consistency between the text and its visual representation.
  • Implemented By: Visualization libraries, State management.
  • Type of Module: Abstract Object.

Settings Module (M5)

  • Secrets: Mechanism for storing, retrieving, and applying user-specific preferences such as themes, font styles, and interface customizations.
  • Services: Allows users to configure the application’s appearance and behavior, maintaining these settings across sessions for a consistent experience. Adapts preferences dynamically based on user settings input.
  • Implemented By: LocalStorage APIs.
  • Type of Module: Abstract Data Type.

Rendering Module (M6)

  • Secrets: Parsing and rendering algorithms for converting user input, such as Markdown, into formatted text. Techniques for applying visual enhancements like annotations and syntax highlighting.
  • Services: Provides real-time previews of user content, enabling better visualization of the writing process. Enhances readability and interactivity through features such as dynamic formatting and highlights.
  • Implemented By: Markdown libraries, Frontend rendering engines.
  • Type of Module: Library.

Data Fetching and State Management Module (M7)

  • Secrets: Data synchronization logic for handling drafts, preferences, and metadata between the local storage and the application state.
  • Services: Ensures seamless data fetching, caching, and synchronization, preserving user progress and settings across sessions locally. Manages application state for a smooth, uninterrupted user experience.
  • Implemented By: LocalStorage.
  • Type of Module: Record.

Software Decision

Inference Module (M8)

  • Secrets: Integration techniques for advanced machine learning models, including parameter tuning and managing interactions between model outputs and user preferences.
  • Services: Generates personalized text suggestions by analyzing user input and context. Adapts dynamically to user-defined parameters like tone, style, and creativity.
  • Implemented By: Llama 3.
  • Type of Module: Library.

User Configuration Module (M9)

  • Secrets: Data structures and Algorithms for managing user-specific settings, including tone preferences, feature configurations, and real-time updates.
  • Services: Maintains user preferences and applies them consistently across all features of the application. Ensures personalized experiences by dynamically adapting configurations during sessions.
  • Implemented By: LocalStorage APIs.
  • Type of Module: Record.

Analytics Module (M10)

  • Secrets: Algorithms for tracking user progress metrics such as word count, tone consistency, and goal completion rates. Methods for generating insights and visual feedback.
  • Services: Tracks real-time user progress and provides actionable insights to achieve writing goals. Offers intuitive feedback to guide users in improving their writing process.
  • Implemented By: Frontend analytics tools.
  • Type of Module: Library.

Export and Integration Module (M11)

  • Secrets: Mechanism for formatting and exporting documents in multiple formats, such as Markdown and PDF, while preserving structure and integrity. Techniques for seamless integration with third-party tools.
  • Services: Converts user content into export-ready formats and integrates with platforms like Google Docs or Notion for enhanced workflow and collaboration.
  • Implemented By: REST APIs, Browser APIs.
  • Type of Module: Abstract Object.

Traceability Matrix

This section shows two traceability matrices: between the modules and the requirements and between the modules and the anticipated changes.

Req.Modules
R1M1, M2, M3, M4
R2M2, M3
R3M5
R4M6, M4
R5M6, M7, M4, M8, M9, M10
R6M6, M7, M4, M8, M9, M10
R7M6, M11, M4, M8, M10
R8M6, M11, M4, M8, M10
R9M12
R10M6, M7, M4
R11M6, M7, M11, M4
Table 2: Trace Between Requirements and Modules
ACModules
AC1M1
AC2M2
AC3M3
AC4M5
AC5M6
AC6M12
AC7M7
AC8M11
AC9M4
AC10M8
AC11M9
AC12M10
Table 3: Trace Between Anticipated Changes and Modules

Use Hierarchy Between Modules

In this section, the uses hierarchy between modules is provided. (Parnas et al., 1984) said of two programs A and B that A uses B if correct execution of B may be necessary for A to complete the task described in its specification. That is, A uses B if there exist situations in which the correct functioning of A depends upon the availability of a correct implementation of B. Figure 1 illustrates the use relation between the modules. It can be seen that the graph is a directed acyclic graph (DAG). Each level of the hierarchy offers a testable and usable subset of the system, and modules in the higher level of the hierarchy are essentially simpler because they use modules from the lower levels.

User Interfaces

Design of Communication Protocols

Timeline

See also Projects


Appendix

Parnas, D. L. (1972). On the criteria to be used in decomposing systems into modules. Commun. ACM, 15(12), 1053–1058. https://doi.org/10.1145/361598.361623
Parnas, D. L., Clements, P. C., & Weiss, D. M. (1984). The modular structure of complex systems. Proceedings of the 7th International Conference on Software Engineering, 408–417.

Revision

DateVersionNotes
Jan. 05 20250.0Created rough sketches for architecture
Jan. 10 20240.1Purpose
Jan. 12 20240.2Project Overview/Diagrams
Jan. 13 20240.3Updated Module Hierarchy and Table
Jan. 16 20240.4Finished Module Decomposition