--- sidebar_position: 7 title: HTML --- HTML is a common format for presenting data in the web. While the general read functions (`XLSX.read` and `XLSX.readFile`) can parse HTML strings and the write functions (`XLSX.write` and `XLSX.writeFile`) can generate HTML strings, the utility functions in this section can use DOM features. :::note SheetJS CE primarily focuses on data and number formatting. [SheetJS Pro](https://sheetjs.com/pro) supports CSS text and cell styles in the HTML format and HTML table utilities. ::: ## HTML Table Input ### Create New Sheet **Create a worksheet or workbook from a TABLE element** ```js var ws = XLSX.utils.table_to_sheet(elt, opts); var wb = XLSX.utils.table_to_book(elt, opts); ``` `XLSX.utils.table_to_sheet` takes a table DOM element and returns a worksheet resembling the input table. Numbers are parsed. All other data will be stored as strings. `XLSX.utils.table_to_book` produces a minimal workbook based on the worksheet. Both functions accept options arguments: | Option Name | Default | Description | | :---------- | :------: | :-------------------------------------------------- | |`raw` | | If true, every cell will hold raw strings | |`dateNF` | FMT 14 | Use specified date format in string output | |`cellDates` | false | Store dates as type `d` (default is `n`) | |`sheetRows` | 0 | If >0, read the first `sheetRows` rows of the table | |`display` | false | If true, hidden rows and cells will not be parsed | Exporting a table to a spreadsheet file in the web browser involves 3 steps: "find the table", "generate a workbook object", and "export to file". For example, if the HTML table has `id` attribute set to `sheetjs`: ```html
NameIndex
Barack Obama44
Donald Trump45
Joseph Biden46
``` `document.getElementById("sheetjs")` is a live reference to the table. ```js /* find the table element in the page */ var tbl = document.getElementById('sheetjs'); /* create a workbook */ var wb = XLSX.utils.table_to_book(tbl); /* export to file */ XLSX.writeFile(wb, "SheetJSTable.xlsx"); ```
Demo (click to hide) This HTML table has id set to `sheetjs`:
NameIndex
Barack Obama44
Donald Trump45
Joseph Biden46
```jsx live function SheetJSExportTable() { return ( ); } ```
### Add to Sheet **Add data from a TABLE element to an existing worksheet** ```js XLSX.utils.sheet_add_dom(ws, elt, opts); ``` `XLSX.utils.sheet_add_dom` takes a table DOM element and updates an existing worksheet object. It follows the same process as `table_to_sheet` and accepts an options argument: | Option Name | Default | Description | | :---------- | :------: | :-------------------------------------------------- | |`raw` | | If true, every cell will hold raw strings | |`dateNF` | FMT 14 | Use specified date format in string output | |`cellDates` | false | Store dates as type `d` (default is `n`) | |`sheetRows` | 0 | If >0, read the first `sheetRows` rows of the table | |`display` | false | If true, hidden rows and cells will not be parsed | `origin` is expected to be one of: | `origin` | Description | | :--------------- | :-------------------------------------------------------- | | (cell object) | Use specified cell (cell object) | | (string) | Use specified cell (A1-Style cell) | | (number >= 0) | Start from the first column at specified row (0-indexed) | | -1 | Append to bottom of worksheet starting on first column | | (default) | Start from cell `A1` | A common use case for `sheet_add_dom` involves adding multiple tables to a single worksheet, usually with a few blank rows in between each table: ```js /* get "table1" and create worksheet */ const table1 = document.getElementById('table1'); const ws = XLSX.utils.table_to_sheet(table1); /* get "table2" and append to the worksheet */ const table2 = document.getElementById('table2'); // highlight-next-line XLSX.utils.sheet_add_dom(ws, table2, {origin: -1}); ```
Multi-table Export Example (click to show) This demo creates a worksheet that should look like the screenshot below: ![Multi-Table Export in Excel](pathname:///files/multitable.png) The `create_gap_rows` helper function expands the worksheet range, adding blank rows between the data tables. ```jsx live function MultiTable() { const headers = ["Table 1", "Table2", "Table 3"]; /* Callback invoked when the button is clicked */ const xport = React.useCallback(async () => { /* This function creates gap rows */ function create_gap_rows(ws, nrows) { var ref = XLSX.utils.decode_range(ws["!ref"]); // get original range ref.e.r += nrows; // add to ending row ws["!ref"] = XLSX.utils.encode_range(ref); // reassign row } /* first table */ const ws = XLSX.utils.aoa_to_sheet([[headers[0]]]); XLSX.utils.sheet_add_dom(ws, document.getElementById('table1'), {origin: -1}); create_gap_rows(ws, 1); // one row gap after first table /* second table */ XLSX.utils.sheet_add_aoa(ws, [[headers[1]]], {origin: -1}); XLSX.utils.sheet_add_dom(ws, document.getElementById('table2'), {origin: -1}); create_gap_rows(ws, 2); // two rows gap after second table /* third table */ XLSX.utils.sheet_add_aoa(ws, [[headers[2]]], {origin: -1}); XLSX.utils.sheet_add_dom(ws, document.getElementById('table3'), {origin: -1}); /* create workbook and export */ const wb = XLSX.utils.book_new(); XLSX.utils.book_append_sheet(wb, ws, "Export"); XLSX.writeFile(wb, "SheetJSMultiTablexport.xlsx"); }); return ( <>

{headers[0]}
A2B2
A3B3
{headers[1]}
A6B6C6
A7B7C7

{headers[2]}
A11B11
A12B12
); } ```
### HTML Strings **Create a worksheet or workbook from HTML string** `table_to_book` / `table_to_sheet` / `sheet_add_dom` act on HTML DOM elements. Starting from an HTML string, there are two parsing approaches: A) Table Phantasm: create a DIV whose `innerHTML` is set to the HTML string, generate worksheet using the DOM element, then remove the DIV: ```js /* create element from the source */ var elt = document.createElement("div"); elt.innerHTML = html_source; document.body.appendChild(elt); /* generate worksheet */ var ws = XLSX.utils.table_to_sheet(elt.getElementsByTagName("TABLE")[0]); /* remove element */ document.body.removeChild(elt); ```
Phantasm Demo (click to show) The `html` variable in the demo is an editable HTML string ```jsx live function SheetJSTablePhantasm() { /* HTML stored as a string */ const html = `\
NameIndex
Barack Obama44
Donald Trump45
Joseph Biden46
`; return ( <>
HTML:
{html}
); } ```
B) Raw HTML: use `XLSX.read` to read the text in the same manner as CSV. ```js var wb = XLSX.read(html_source, { type: "string" }); var ws = wb.Sheets[wb.SheetNames[0]]; ```
Raw HTML Demo (click to show) The `html` variable in the demo is an editable HTML string ```jsx live function SheetJSRawHTMLToXLSX() { /* HTML stored as a string */ const html = `\
NameIndex
Barack Obama44
Donald Trump45
Joseph Biden46
`; return ( <>
HTML:
{html}
); } ```
### Value Override When the `raw: true` option is specified, the parser will generate text cells. When the option is not specified or when it is set to false, the parser will try to interpret the text of each TD element. To override the conversion for a specific cell, the following data attributes can be added to the individual TD elements: | Attribute | Description | |:----------|:-------------------------------------------------------------| | `data-t` | Override [Cell Type](/docs/csf/cell#data-types) | | `data-v` | Override Cell Value | | `data-z` | Override [Number Format](/docs/csf/features/nf) | For example: ```html 2012-12-03 2012-12-03 2012-12-03 2012-12-03 ```
HTML Value Examples (click to hide) ```jsx live function SheetJSHTMLValueOverride() { /* HTML stored as a string */ const html = `\
Celldata-tdata-vdata-z
2012-12-03
2012-12-03s
2012-12-03n41246
2012-12-03n41246yyyy-mm-dd
`; return ( <>
HTML String:
{html}
TABLE:
); } ```
### Synthetic DOM `table_to_book` / `table_to_sheet` / `sheet_add_dom` act on HTML DOM elements. Traditionally there is no DOM in server-side environments including NodeJS. :::note The simplest approach for server-side processing is to automate a headless web browser. ["Browser Automation"](/docs/demos/net/headless) covers some browsers. ::: Some ecosystems provide DOM-like frameworks that are compatible with SheetJS. Examples are included in the ["Synthetic DOM"](/docs/demos/net/dom) demo