docs.sheetjs.com/29-pouchdb.md at 2e44e0c2c89c359440b103dfe28128466f402024

LankyMoose/docs.sheetjs.com

SheetJS 92e3c5aa72 mdx cleanup in preparation for v2

- use autolinks (e.g <https://sheetjs.com> -> https://sheetjs.com)
- move <summary> blocks to separate lines

2024-04-08 00:57:39 -04:00

7.7 KiB

Raw Blame History

title

sidebar_label

pagination_prev

pagination_next

sidebar_custom_props

Sheets in PouchDB

PouchDB

demos/cli/index

demos/local/index

type
nosql

import current from '/version.js'; import CodeBlock from '@theme/CodeBlock';

PouchDB is a pure JavaScript database with built-in synchronization features and offline support.

SheetJS is a JavaScript library for reading and writing data from spreadsheets.

This demo uses PouchDB and SheetJS to export database snapshots to spreadsheets and import bulk data from workbooks. We'll explore the subtleties of processing arrays of objects to mesh with both libraries.

The "Complete Example" section imbues the official "Todos" demo with the ability to export the list to XLSX workbooks.

:::note Tested Deployments

This demo was tested in the following environments:

PouchDB	Date
`7.3.1`	2023-12-04
`8.0.1`	2023-12-04

:::

Integration Details

SheetJS CE offers standalone scripts, NodeJS modules, ESM modules, and other scripts. The "Installation" section covers a number of common deployment scenarios.

PouchDB ships with standalone scripts for browser use and NodeJS modules for use in server-side scripts¹.

The PouchDB constructor returns a Database object.

Importing Data

Database#bulkDocs² is the standard approach for bulk data import. The method accepts "arrays of objects" that can be generated through the SheetJS sheet_to_json³ method.

If rows do not include the _id parameter, the database will automatically assign an ID per row. It is strongly recommended to generate the _id directly.

This method starts from a SheetJS workbook object⁴ and uses data from the first sheet. read and readFile⁵ can generate workbook objects from files.

async function push_first_sheet_to_pouchdb(db, wb, _id_) {
  /* get first worksheet */
  const ws = wb.Sheets[wb.SheetNames[0]];

  /* generate array of objects */
  const aoo = XLSX.utils.sheet_to_json(ws);

  /* if a prefix is specified, add a unique _id to each row based on index */
  if(typeof _id_ == "string") aoo.forEach((row, idx) => row._id = _id_ + idx);

  /* perform query */
  return await db.bulkDocs(aoo);
}

:::note pass

Existing data can be erased with Database#destroy.

:::

Exporting Data

Database#allDocs⁶ is the standard approach for bulk data export. Generated row objects have additional _id and _rev keys that should be removed.

After removing the PouchDB internal fields, the SheetJS json_to_sheet⁷ method can generate a worksheet. Other utility functions⁸ can construct a workbook. The workbook can be exported with the SheetJS writeFile⁹ method:

function export_pouchdb_to_xlsx(db) {
  /* fetch all rows, including the underlying data */
  db.allDocs({include_docs: true}, function(err, doc) {

    /* pull the individual data rows */
    const aoo = doc.rows.map(r => {
      /* `rest` will include every field from `r` except for _id and _rev */
      const { _id, _rev, ...rest } = r;
      return rest;
    });

    /* generate worksheet */
    const ws = XLSX.utils.json_to_sheet(aoo);

    /* generate workbook and export */
    const wb = XLSX.utils.book_new();
    XLSX.utils.book_append_sheet(wb, ws, "Sheet1");
    XLSX.writeFile(wb, "SheetJSPouch.xlsx");
  });
}

:::caution pass

json_to_sheet expects an array of "flattened" objects where each value is a simple data type that can be stored in a spreadsheet cell. If document objects have a nested structure, integration code should post-process the data.

"Export Tutorial" processes data from an API and computes a few text values from the nested data.

:::

Complete Example

Download the "Working Version" from the Getting Started guide.

The ZIP file should have MD5 checksum ac4da7cb0cade1be293ba222462f109c:

curl -LO https://github.com/nickcolley/getting-started-todo/archive/master.zip
md5sum master.zip || md5 master.zip
### the checksum will be printed

If the download is unavailable, a mirror is available at https://docs.sheetjs.com/pouchdb/master.zip

Unzip the master.zip file and enter the folder:

unzip master.zip
cd getting-started-todo-master

Edit index.html to reference the SheetJS library and add a button:

{`\

Export!

Near the end of index.html, look for a script tag referencing a CDN:

    <script src="//cdn.jsdelivr.net/pouchdb/3.2.0/pouchdb.min.js"></script>

Upgrade PouchDB by changing the src attribute to the production build¹⁰:

    <script src="//cdn.jsdelivr.net/npm/pouchdb@8.0.1/dist/pouchdb.min.js"></script>

Just before the end of js/app.js, add a click event listener:

  if (remoteCouch) {
    sync();
  }

  // highlight-start
  document.getElementById("xport").addEventListener("click", function() {
    db.allDocs({include_docs: true, descending: true}, function(err, doc) {
      const aoo = doc.rows.map(r => {
        const { _id, _rev, ... rest } = r.doc;
        return rest;
      });
      const ws = XLSX.utils.json_to_sheet(aoo);
      const wb = XLSX.utils.book_new(); XLSX.utils.book_append_sheet(wb, ws, "Sheet1");
      XLSX.writeFile(wb, "SheetJSPouch.xlsx");
    });
  });
  // highlight-end
})();

:::info pass

The demo UI reads the todo items in descending order:

    //------------------------------VVVVVVVVVVVVVVVV (descending order)
    db.allDocs({include_docs: true, descending: true}, function(err, doc) {
      redrawTodosUI(doc.rows);
    });

The new callback function also specifies descending: true to ensure that the order of todo items in the export matches the list displayed in the webpage.

:::

Start a local web server:

npx http-server .

The command will display a URL (typically http://localhost:8080) which can be opened in a web browser.

Testing

Access the URL from step 5 with a web browser.
Add two items "Sheet" and "js". Mark "Sheet" as completed. The page should look like the following screenshot:

Click the "Export!" text at the top of the page. The site should create an export named "SheetJSPouch.xlsx"
Open the file in a spreadsheet editor. It should match the following table:

title	completed
Sheet	TRUE
js	FALSE

See "Setting up PouchDB" in the PouchDB documentation. ↩︎
See "Create/update a batch of documents" in the PouchDB API documentation ↩︎
See sheet_to_json in "Utilities" ↩︎
See "SheetJS Data Model" ↩︎
See read in "Reading Files" ↩︎
See "Fetch a batch of documents" in the PouchDB API documentation ↩︎
See json_to_sheet in "Utilities" ↩︎
See "Workbook Helpers" in "Utilities" for details on book_new and book_append_sheet. ↩︎
See writeFile in "Writing Files" ↩︎
The "Quick Start" section of "Download" in the PouchDB website describes the recommended CDN for PouchDB scripts. ↩︎

7.7 KiB Raw Blame History

Integration Details

Importing Data

Exporting Data

Complete Example

7.7 KiB

Raw Blame History