MSPowerhouse — Your Strategic IT PartnerMSPowerhouse

Professional Services

Smartsheet to Azure Data Lake Gen2 Integration

The client wanted all Smartsheet data — not a single sheet export — in Azure Data Lake Gen2. MSPowerhouse used the Smartsheet REST API to discover sheets, loop through them, handle pagination, and land raw JSON by source, sheet, and run date.

CLIENT:

Confidential

ENGAGEMENT:

2024

SHARE

Smartsheet to Azure Data Lake Gen2 Integration

Overview

The client wanted to pull all available Smartsheet data into Azure Data Lake Gen2. A Smartsheet linked service was available, but it was not available in the expected Azure Data Factory copy tool experience, which created confusion around how to complete the integration.

Challenge

  • Needed full data pull across sheets, rows, columns, workspaces, and metadata.
  • Standard connector path did not provide the control needed for the requirement.
  • Required dynamic discovery, looping, and structured landing for future reporting.

Solution

MSPowerhouse designed a REST API-based Azure Data Factory pipeline for Smartsheet. Instead of depending only on the linked service experience, the solution used Smartsheet's API to discover available sheets and then loop through each sheet to extract detailed row and column data.

The pipeline was designed with a master-child pattern. The master pipeline handled discovery and orchestration, while child pipelines handled specific extraction tasks such as pulling sheet details, rows, metadata, or related objects. Data was landed in Azure Data Lake Gen2 in raw JSON format first, preserving the source structure for future transformation.

Technical Execution

  • Azure Data Factory REST API integration.
  • Smartsheet API authentication.
  • Sheet discovery process.
  • Looping through sheet IDs.
  • Pulling sheet details, rows, columns, and metadata.
  • Pagination handling.
  • Dynamic folder paths in ADLS Gen2.
  • Raw JSON landing by source, sheet, and run date.
  • Optional future expansion for attachments, discussions, and users.
  • A design that could support full loads and future incremental refreshes where supported.

Outcome

The client received a clear path for pulling Smartsheet data into Azure Data Lake Gen2 even though the built-in copy tool experience did not support the requirement cleanly.

Impact

This project showed MSPowerhouse's ability to work around connector limitations and still deliver a proper enterprise integration using APIs, ADF orchestration, and data lake storage.

Services Delivered

Azure Data FactorySmartsheet APIAzure Data Lake Gen2
Smartsheet to Azure Data Lake Gen2 Integration | MSPowerhouse