Sync API Docs

khuongduy354 · 23 May 2024 12:15

Sync API Docs

Main docs for Sync API users.

To use this API, assuming that sync target is available (user setup and ran synchronize on joplin client at least once)
Assuming that the library supports sync version 3 (latest sync version).
Repository: GitHub - khuongduy354/joplin-sync-lib
Demo: joplin-sync-lib/src/sample_app at master · khuongduy354/joplin-sync-lib · GitHub

Initialization

// 1. Pick a sync target 
// currently, database set to null as argument, in the future, we may inject a db instance
const syncTarget = new FileSystemSyncTarget(null);   

// 2. Init File API   
// depending on file api, it may be different, for e.g: filesystem file api need a path to a directory on the machine.
const syncPath = "src/sample_app/Storage/fsSyncTarget"; // filesystem sync target
await syncTarget.initFileApi(syncPath); 

// or, a MemorySyncTarget doesn't need to provide anything   
// const syncTarget = new MemorySyncTarget(null);
await syncTarget.initFileApi();


// 3. Retrieve synchronizer 
// with the synchronizer we can perform operations directly to sync target
const syncer = await syncTarget.synchronizer();  


// Extra: Init sync info method 
// as stated before, Sync API only works with initialized sync target (contains info.json),
// and sync target should be initialized using Joplin clients: Desktop, Mobile,... 
// this method below is used for testing only, which will initialize remote from code by creating a info.json file
await syncer.migrationHandler().initSyncInfo3();

Synchronizer Operations

After initializing Synchronizer from above steps, these methods are supported:

// GET a single item from remote, 
// Provide either an id or a path of item
// unserializeItem option is default to false, if true will return item as an object, return as string otherwise
.getItem(getItemInput): getItemOutput
type getItemInput = {
  path?: string;
  id?: string;
  unserializeItem?: boolean;
};
type getItemOutput = string | null | any;
 
// GET multiple items from remote 
// Similar to getItem, provide a list of ids (paths are not supported), unserializeAll === true will return items as an array of object, return an array of string otherwise
.getItems(getItemsInput): getItemsOutput  
type getItemsInput = {
  ids: string[];
  unserializeAll?: boolean;
};
type getItemsOutput = any[];


// GET items metadata from remote  
// context.timetamp return items after specified timestamp (inclusive), this is useful for delta, or detecting new items. 
// outputLimit is default to 50, will retrieve a maximum of X items, if there're more items needed retrieving, the output contains a hasMore flag and a timestamp to fetch more.
.getItemsMetadata(getItemsMetadataInput): getItemsMetadataOutput 
type getItemsMetadataInput = {
  context: {
    timestamp?: number; //in unixMs, retrieve items with .updated_time field after timestamp (inclusive)
  };
  outputLimit?: number; // default to 50 
};

type getItemsMetadataOutput = {
  items: any[];
  hasMore: boolean; 
  context: {
    timestamp: number; // use to fetch more, or keep track of next fetch for delta
  };
};

// UPDATE an item  
// To avoid conflict, this method only allow updating an item if its .updated_time field on remote is matched exactly with lastSync parameter. 
.updateItem(updateItemInput): updateItemOutput
type updateItemInput = {
  item: any; // preferably in Joplin BaseItem format
  lastSync: number; // timestamp in unixMs
};
type updateItemOutput = {
  // conflicted means the client timestamp is older than remote, which means another client has updated and this client hasn't pull the changes yet.
  // inaccurate timestamp means the client timestamp is newer than remote, which shouldn't be possible, because lastSync timestamp should be updated whenever both sides sync, the client can't independent sync, and has newer timestamp than remote. This is a result of wrongly tracked timestamp on client.  
  // succeeded means the lastSync arguments exactly equal item.updated_time on remote, and the item will be updated, it will return a newSyncTime, which client should keep track and use as lastSync argument for next update.
  status: "conflicted" | "inaccurate timestamp" | "succeeded";
  message: string;

  // return when conflicted, use this to resolve conflict
  remoteItem?: any;

  //  return when success
  newItem?: any;
  oldItem?: any;
  newSyncTime?: number; // updated timestamp
};

// CREATE multiple items on remote 
// The provided items should at least have .type_ field, if it's a resource (type_ == 4), then provide a path to the resource
// Items ids will be generated automatically during creation regardless of input contains id or not, this prevent the client to provide an already available id and cause conflict.
.createItems(createItemsInput): createItemsOutput
type createItemsInput = {
  items: any[]; //preferably array of Joplin BaseItem
};
type createItemsOutput = {
  createdItems: any[];
  failedItems: { item: any; error: any }[];
};

// DELETE multiple items 
// The provided items should at least have .type_ field and id, if it's a resource (type_ == 4), then this operation will find the blob and metadata, and delete both (2 delete API calls) for each item.
.deleteItems(deleteItemsInput): deleteItemsOutput
type deleteItemsInput = {
  deleteItems: any[];
};
type deleteItemsOutput = {
  status:
    | "succeeded"
    | "item not found" // item with provided id not available on remote
    | "could not delete item" // unknown error 
    | "read-only item can't be deleted"; 
  item?: any;
  error?: any;
};

// VERIFY sync info version and E2E settings on remote
// Run before every Sync operations, it will fetch remote sync info (info.json file) and make sure its sync version is 3 
// Then it will looks for remote E2E settings, and compare to the input E2E settings and prompts approriate actions client has to do to resolve conflicts (if happens), see E2E docs.
.verifySyncInfo(verifySyncInfoInput): verifySyncInfoOutput
type verifySyncInfoInput = {
  E2E: {
    ppk?: PublicPrivateKeyPair;
    e2ee: boolean;
  };
};

type verifySynInfoOutput = {
  status: "success" | "aborted";
  message: string;
  remoteSyncInfo?: any; // for debug
};

khuongduy354 · 11 July 2024 15:17

E2E integration flow for Sync API

Overview

This integration is aimed to track the current state of user E2E when sync, it will not fix or change E2E configurations automatically, user have to config manually if needed.

For this E2E flow, the cases are based on:

End-To-End Encryption (E2EE) | Joplin
Encryption | Joplin
To summarize there're 3 cases:

(1) Enable E2E (enable E2E on 1 device and populate E2E setup automatically through Synchronization) Device 1 turn on E2E, sync to sync target. Device 2 pick up E2E setup on next sync.
(2) Disable E2E (disable E2E on every device manually): both devices in synced with E2E on, device 1 then disable E2E and sync
(3) Changed E2E MasterKey: Assuming both devices have E2E enable properly and synced. Device 1 then change E2E key (change password) and sync.

In our Library, users cannot initialize an E2E setup (provide a password or toggle it on/off) because it requires proper re-encrypt and reupload data after an E2E is setup, but this library don't have much control over users' data.
Users can only initialize from Joplin Client (similar to sync initialization).
To determine which case and what actions to have, we'll start extracting from remoteSyncInfo, users should also provides last fetched E2EInfo (not provided means E2E disabled locally).

remoteE2EInfo = { 
activeMasterKeyId: string; 
ppk_: PublicPrivateKeyPair; 
e2ee: boolean; //indicate whether encryption is enable/disable  
}

localE2EInfo = { 
ppk: PublicPrivateKeyPair; 
e2ee: boolean; 
}

E2E setup flow

local.e2ee represents the state of our library E2E, if it's true all items are encrypted automatically before upload, other features related to E2E are also enabled.
Before every operations, the library will read remote and local (provided by user) sync infos, to check E2E states, and there're 4 cases below.

remote disabled, local disabled

E2E is disabled across devices, so allowing local.e2ee to false is fine

remote enabled, local disabled

Our client is the Device 2 in case (1) above, instruct user to enable local E2E, set local to remote ppk, set local masterKeyId to remote.activeMasterKeyId (will be used for encryption), prompts user to encrypt and reupload all items. Exactly like this

remote disabled, local enabled

Our client is Device 2 in case (2), instruct user to set local.e2ee to false manually, as image:

remote enabled, local enabled

-> we further check if the ppk of remote and client matched

if matched, which means 2 devices are synced properly with E2E
if unmatched, our client is Device 2 in case (3), instruct user to fetch the correct ppk, provides correct password for items encryption.

How E2E is applied if enabled

Synchronizer class hold the E2E setup infos (there's setter method for client to set), only 2 operations: CREATE and UPDATE can apply encryption, which uses the ItemUploader class, it will take the E2E input from Synchronizer and perform encryption accordingly.
For READ methods, client may need extra code to decrypt content with master key, because results return from Sync API will be encrypted string.

Topic		Replies	Views
Coding period Update - Week 6 Standalone Sync API	2	107	30 July 2024
Coding period Update - Week 1 Standalone Sync API	1	160	10 June 2024
Coding period update - Week 10 Standalone Sync API	0	67	18 August 2024
Coding period update - Week 4 Standalone Sync API	0	100	27 June 2024
Coding period update - Week 8-9 Standalone Sync API	0	68	1 August 2024

Sync API Docs

Sync API Docs

Initialization

Synchronizer Operations

E2E integration flow for Sync API

Overview

E2E setup flow

remote disabled, local disabled

remote enabled, local disabled

remote disabled, local enabled

remote enabled, local enabled

How E2E is applied if enabled

Related topics