On this page

Intro
What Data You Can Extract
API Access
Key Extraction Flows
Automation Tools
What You Can Build
Challenges & Gotchas
FAQ
Explore Semarize
Related Resources

Get Your Data

Jiminny - How to Get Your Conversation Data

A practical guide to getting your conversation data out of Jiminny - covering REST API access, historical backfill, incremental polling, and how to route structured data into your downstream systems.

What you'll learn

What conversation data you can extract from Jiminny - transcripts, metadata, speaker labels, coaching data, and CRM context
How to access data via the Jiminny Customer API - API key authentication, endpoints, and pagination
Three extraction patterns: historical backfill, incremental polling, and scheduled batch
How to connect Jiminny data pipelines to Zapier, n8n, and Make
Advanced use cases - coaching A/B testing, talk track analysis, handoff scoring, and custom dashboards

Data

What Data You Can Extract From Jiminny

Jiminny captures more than just the recording. Every activity produces a set of structured assets that can be extracted via the Customer API - the transcript itself, speaker identification, timing metadata, coaching frameworks, and contextual CRM data associated with the call.

Common fields teams care about

Full transcript text with timestamps

Speaker labels (rep vs. prospect)

Call metadata (date, time, duration)

Associated CRM data and deal context

Coaching framework scores

Action items and follow-ups

Topics and key moments

Questions asked during the call

Recording links (expire after 24 hours)

Participant list and contact details

API Access

How to Get Transcripts via the Jiminny API

Jiminny exposes activities and transcripts through a REST API documented at jiminny.github.io/customer-api-docs (Swagger UI). The workflow is: authenticate with an API key, list activities by date range, then fetch the transcript for each activity.

Authenticate

Jiminny uses API Key authentication. Your Jiminny admin generates the key from the admin settings panel. Pass the key in the Authorization header on every request.

Authorization: Bearer <api_key>
Content-Type: application/json

API keys are generated by Jiminny admins. Contact your Jiminny admin to provision credentials if you don't have them. Full API documentation is available at jiminny.github.io/customer-api-docs.

List activities by date range

Call the activities endpoint with date range filters. Results are paginated - each response includes a cursor to fetch the next page. Jiminny recommends batches of 500–1000 activities for optimal performance.

GET /v1/activities?from=2025-01-01T00:00:00Z&to=2025-02-01T00:00:00Z&limit=500

Authorization: Bearer <api_key>

The response returns an array of activity objects with id, timestamp, duration, participants, and associated CRM data. Keep paginating until all results are returned.

Fetch the transcript

For each activity ID, request the transcript from the activity detail endpoint. The response contains speaker-labelled utterances with timestamps, plus associated coaching data, action items, topics, and questions.

GET /v1/activities/<activity_id>/transcript

Authorization: Bearer <api_key>

Each utterance in the transcript includes speaker, timestamp, and text. Reassemble into plain text by concatenating utterances, or preserve the structured format for per-speaker analysis.

Handle rate limits and recording expiry

Rate limits

Respect the API's rate limits and use the recommended batch sizes of 500–1000 activities. When you receive a rate limit response, back off and retry. For bulk operations, pace requests to avoid hitting ceilings, especially during backfills.

Recording link expiry

Recording links returned by the Jiminny API expire after 24 hours. If you need to retain access to audio or video files, download them within that window. Transcript text does not expire - only the media URLs are time-limited.

Patterns

Key Extraction Flows

There are three practical patterns for getting transcripts out of Jiminny. The right choice depends on whether you're doing a one-off migration, running ongoing extraction, or need scheduled batch processing.

Backfill (Historical Export)

One-off migration of past calls

Define your date range — typically 6–12 months of historical activities, or all available data if migrating

Call the activities endpoint with your date range filters. Use batch sizes of 500–1000 for optimal performance. Paginate through the full result set, collecting all activity IDs

For each activity ID, fetch the transcript via the transcript endpoint. Pace requests to stay within rate limits

Store each transcript with its activity metadata (activity ID, date, participants, CRM context, coaching data) in your data warehouse or object store

Once the backfill completes, run your analysis pipeline against the stored data in bulk

Tip: Persist your pagination cursor between batches. If the process is interrupted, you can resume from where you left off instead of re-scanning from the start. Remember that recording links expire after 24 hours, so download media files during the backfill if needed.

Incremental Polling

Ongoing extraction on a schedule

Set a cron job or scheduled trigger (hourly, daily, etc.) that runs your extraction script

On each run, call the activities endpoint with the from parameter set to your last successful poll timestamp

Fetch transcripts for any new activity IDs returned. Use the activity ID as a deduplication key to avoid reprocessing

Route each transcript and its metadata to your downstream pipeline — analysis tool, warehouse, or automation platform

Update your stored cursor / timestamp to the current run time for the next poll cycle

Tip: Account for transcript processing delay. A call that ended 10 minutes ago may not have a transcript yet. Polling with a 1\u20132 hour lag reduces empty fetches.

Scheduled Batch Processing

Daily or weekly bulk extraction and analysis

Set up a scheduled job (daily end-of-day or weekly) that collects all activities from the previous period

Pull activities in batches of 500–1000 using the recommended batch sizes for the Jiminny API

Fetch transcripts, coaching data, action items, and CRM context for each activity in the batch

Route the complete dataset to your analysis pipeline — run Semarize kits in bulk, then write structured output to your warehouse or CRM

Note:Batch processing works well with Jiminny because the API is optimised for bulk retrieval. Use batches of 500–1000 activities per request for the best balance of throughput and reliability.

Automation

Send Jiminny Transcripts to Automation Tools

Once you can extract transcripts from Jiminny, the next step is routing them through Semarize for structured analysis and into your downstream systems. Below are end-to-end example flows - each showing the full pipeline from Jiminny API through Semarize evaluation to CRM, Slack, or database output.

ZapierNo-code automation

Jiminny → Zapier → Semarize → CRM

Poll Jiminny for new activities on a schedule, fetch the transcript, send it to Semarize for structured analysis, then write the scored output - signals, flags, and evidence - directly to your CRM.

Example Zap

Schedule by Zapier

Polls for new Jiminny activities

Trigger: Every Hour

Timezone: UTC

Webhooks by Zapier

List activities from Jiminny API

Method: GET

URL: /v1/activities?from={{last_run}}

Auth: Bearer <api_key>

For each activity

Webhooks by Zapier

Fetch transcript from Jiminny

Method: GET

URL: /v1/activities/{{id}}/transcript

Auth: Bearer <api_key>

Transcript returned

Webhooks by Zapier

POST /v1/runs (sync) to Semarize

Method: POST

URL: https://api.semarize.com/v1/runs

Auth: Bearer smz_live_...

Body: { kit_code, mode: "sync", input: { transcript } }

Structured output returned

Formatter by Zapier

Extract brick values from Semarize response

Extract: bricks.overall_score.value

Extract: bricks.risk_flag.value

Extract: bricks.pain_point.value

Salesforce - Update Record

Write scored signals to Opportunity

Object: Opportunity

AI Score: {{overall_score}}

Risk Flag: {{risk_flag}}

Pain Point: {{pain_point}}

Setup steps

Create a new Zap. Choose "Schedule by Zapier" as the trigger and set it to run every hour (or your preferred interval).

Add a "Webhooks by Zapier" Action (Custom Request) to list new activities from Jiminny. Set method to GET, URL to the activities endpoint with a from parameter based on last run time, and add your API key as a Bearer token.

Add another "Webhooks by Zapier" Action to fetch the transcript for each activity. Set method to GET and pass the activity ID in the URL.

Add a third "Webhooks by Zapier" Action. Set method to POST, URL to https://api.semarize.com/v1/runs. Add your Semarize API key as a Bearer token. In the body, set kit_code to your Kit, mode to "sync", and map the transcript text into input.transcript.

Add a Formatter step to extract individual brick values from the Semarize JSON response — overall_score, risk_flag, pain_point, etc.

Add a Salesforce (or HubSpot, Sheets, etc.) Action to write the extracted scores and signals to your CRM record.

Test each step end-to-end, then turn on the Zap.

Watch out for: Zapier has step data size limits that can truncate very long transcripts. For calls over 60 minutes, consider storing the transcript in cloud storage and passing a reference URL instead of inline text. Use mode: "sync"so Semarize returns results inline - Zapier doesn't natively support polling loops.

Learn more about Zapier automation

n8nSelf-hosted workflows

Jiminny → n8n → Semarize → Database

Poll Jiminny for new activities on a schedule, fetch transcripts, send each one to Semarize for analysis, then write the structured scores and signals to your database. n8n's native loop support handles pagination and batch processing.

Example Workflow

Cron - Every Hour

Triggers the workflow on schedule

Mode: Every Hour

Timezone: UTC

HTTP Request - List Activities

GET /v1/activities (Jiminny)

Method: GET

URL: /v1/activities?from={{$now.minus(1, 'hour')}}

Auth: Bearer <api_key>

For each activity ID

HTTP Request - Fetch Transcript

GET /v1/activities/:id/transcript

URL: /v1/activities/{{$json.id}}/transcript

Code - Reassemble Transcript

Concatenate utterances into plain text

Join: utterances[].text by speaker

HTTP Request - Semarize

POST /v1/runs (sync)

URL: https://api.semarize.com/v1/runs

Auth: Bearer smz_live_...

Body: { kit_code, mode: "sync", input: { transcript } }

Scores & signals returned

Postgres - Insert Row

Write structured output to database

Table: call_evaluations

Columns: activity_id, score, risk_flag, pain_point

Setup steps

Add a Cron node as the workflow trigger. Set the interval to your desired polling frequency (hourly works well for most teams).

Add an HTTP Request node to list new activities from Jiminny. Set method to GET, URL to the activities endpoint, configure Bearer auth with your API key, and set the from parameter to one interval ago.

Add a Split In Batches node to iterate over the returned activity IDs. Inside the loop, add an HTTP Request node to fetch each transcript via the transcript endpoint.

Add a Code node (JavaScript) to reassemble the utterances array into a single transcript string. Join each utterance's text, prefixed by speaker name.

Add another HTTP Request node to send the transcript to Semarize. Set method to POST, URL to https://api.semarize.com/v1/runs. Add your API key as a Bearer token. Set kit_code, mode to "sync", and map the transcript into input.transcript.

Add a Code node to extract the brick values from the Semarize response — overall_score, risk_flag, pain_point, evidence, confidence.

Add a Postgres (or MySQL / HTTP Request) node to write the structured output. Use activity_id as the primary key for upserts.

Activate the workflow. Monitor the first few runs to verify Semarize responses are arriving and writing correctly.

Watch out for:Use activity IDs as deduplication keys to prevent reprocessing. You can also use async mode with n8n's native loop - POST /v1/runs (default async), then poll GET /v1/runs/:runId with a Wait + IF loop until status is "succeeded".

Learn more about n8n automation

MakeVisual automation with branching

Jiminny → Make → Semarize → CRM + Slack

Fetch new Jiminny transcripts on a schedule, send each to Semarize for structured analysis, then use a Router to branch the scored output - alert on risk flags via Slack and write all signals to your CRM.

Example Scenario

Schedule - Every 30 min

Triggers the scenario on interval

Interval: 30 minutes

HTTP - List New Activities

GET /v1/activities (Jiminny)

Method: GET

Auth: Bearer <api_key>

Params: from={{formatDate(...)}}

HTTP - Fetch Transcript

GET /v1/activities/:id/transcript

Iterator: for each activity in response

URL: /v1/activities/{{item.id}}/transcript

HTTP - Semarize

POST /v1/runs (sync)

URL: https://api.semarize.com/v1/runs

Auth: Bearer smz_live_...

Body: { kit_code, mode: "sync", input: { transcript } }

Structured output

Router - Branch on Risk Flag

Route by Semarize output

Branch 1: IF risk_flag.value = true

Branch 2: ALL (fallthrough)

Branch 1 - Risk detected

Slack - Alert Channel

Notify team about flagged call

Channel: #deal-alerts

Message: Risk on {{activity_id}}, score: {{score}}

Branch 2 - All calls

Salesforce - Update Record

Write all scored signals to Opportunity

AI Score: {{overall_score}}

Risk Flag: {{risk_flag}}

Pain Point: {{pain_point}}

Setup steps

Create a new Scenario. Add a Schedule module as the trigger, set to your desired interval (15–60 minutes is typical).

Add an HTTP module to list new activities from Jiminny. Set method to GET, URL to the activities endpoint, configure Bearer auth, and filter by from parameter since the last run.

Add an Iterator module to loop through each activity. For each, add an HTTP module to fetch the transcript via the transcript endpoint.

Add another HTTP module to send the transcript to Semarize. Set URL to https://api.semarize.com/v1/runs, add your Bearer token, and set kit_code, mode to "sync", and input.transcript from the previous step. Parse the response as JSON.

Add a Router module. Define Branch 1 with a filter: bricks.risk_flag.value equals true. Leave Branch 2 as a fallthrough (no filter).

On Branch 1, add a Slack module to alert your team when risk is detected. Map the score, risk flag, and activity ID into the message.

On Branch 2, add a Salesforce module to write all brick values (score, risk_flag, pain_point) to the Opportunity record.

Set the scenario schedule and activate. Monitor the first few runs in Make’s execution log.

Watch out for: Each API call counts as an operation. A scenario processing 50 activities uses ~150 operations (list + transcript + Semarize per activity). Use mode: "sync" to avoid needing a polling loop for each run.

Learn more about Make automation

What you can build

What You Can Do With Jiminny Data in Semarize

Framework A/B testing, cross-rep talk track analysis, handoff quality scoring, and building your own revenue intelligence layer on structured conversation signals.

Coaching Framework by Segment

Right Methodology for the Right Deal Size

What Semarize generates

custom_framework_correlation = 0.72meddicc_correlation = 0.53segment_winner = "custom_midmarket"deals_analysed = 400

Your mid-market and enterprise teams use different sales motions but the same coaching framework. You build two Semarize kits - one scoring your custom qualification steps, one scoring MEDDICC adherence - and run both against 400 Jiminny transcripts segmented by deal size. The data shows your custom framework correlates 35% more strongly with wins in mid-market, where deals move fast and flexibility matters. But MEDDICC's structured approach outperforms on enterprise deals with longer buying committees. Instead of forcing one framework on everyone, coaching gets tailored by segment - backed by the numbers.

Learn more about Sales Coaching

Coaching Framework A/B Test - 400 calls scored

Custom FrameworkMID-MARKET

Correlation0.72

Win-rate lift+18%

SegmentMid-market

MEDDICCENTERPRISE

Correlation0.53

Win-rate lift+11%

SegmentEnterprise

Custom framework correlates 35% more strongly with wins in mid-market

Cross-Rep Talk Track Effectiveness

Data-Driven Enablement

What Semarize generates

talk_track_variant = "pain_first"engagement_response = 0.78objection_rate = 0.23conversion_lift = 2.1x

Your 20-person sales team uses different talk tracks for the same product. Every call is recorded in Jiminny — but which talk track actually works? Pull all transcripts and run a talk track evaluation kit. Semarize identifies talk_track_variant (which pitch approach was used), prospect_engagement_response, objection_trigger_rate, and conversion_to_next_step. After scoring 600 calls, the data shows that the "pain-first" talk track has a 2.1x higher conversion than "feature-first" — but only for prospects with fewer than 500 employees. The team adopts segment-specific talk tracks backed by evidence.

Learn more about Data Science

Talk Track Comparison - 600 callsSegmented by company size

Pain-first2.1x LIFT

Best for: <500 emp

34%

conversion

0.78

engagement

Feature-first

Best for: All

16%

conversion

0.51

engagement

ROI-led

Best for: >500 emp

28%

conversion

0.65

engagement

Pain-first talk track converts 2.1x better for prospects <500 employees

Deal Handoff Quality Scoring

Handoff Continuity

What Semarize generates

context_carried = truequalification_gaps = 1duplicate_discovery = falsetime_to_close_impact = -45%

When deals move from SDR to AE, critical context often gets lost. Run the last SDR call and first AE call through a handoff continuity kit. Semarize checks context_carried_forward (did the AE reference pain points from the SDR call?), qualification_gaps_addressed, duplicate_discovery_avoided, and prospect_experience_score. After scoring 150 handoffs, the data reveals that deals where the AE references the SDR’s pain discovery in the first 5 minutes close 45% faster. SDR-to-AE handoff templates get restructured.

Learn more about RevOps

Handoff Continuity Score

3/4

SDR CallAE Call

Pain points referencedCarried

Budget qualificationCarried

Timeline confirmedMissed

Decision maker identifiedCarried

Deals with pain referenced in first 5 min close 45% faster

Conversation-Powered Revenue Intelligence Dashboard

Deal Health & Compliance Scoring from Call Content

Vibe-coded

What Semarize generates

deal_health_score = 0.82compliance_score = 0.91risk_level = "high"competitive_mentions = 12

A RevOps lead vibe-codes a dashboard that scores every Jiminny call for deal health and compliance adherence. Semarize analyzes each transcript for buying signals, engagement quality, and competitive pressure - returning a deal_health_score, compliance_score, risk_level, and competitive_mentions count. Deals scoring below 0.5 on health get flagged with specific evidence from the conversation: which buying signals were absent, where engagement dropped, and what competitor claims went unaddressed. Jiminny records and transcribes calls but can't natively derive deal health scores from conversation analysis. After scoring 500 calls, the team catches 3 at-risk deals that CRM stage data alone would have missed - each worth over $80K.

Learn more about Sales Coaching

Revenue Intelligence DashboardVibe-coded with Next.js

Acme Corplow risk

Conversation health0.82

Compliance

0.91

Globex Incmedium risk

Conversation health0.71

Compliance

0.68

Initechhigh risk

Conversation health0.44

Compliance

0.52

Competitive mentions this week3 mentions

Competitor A (2)Competitor B (1)

Watch out for

Common Challenges & Gotchas

These are the issues that come up most often when teams start extracting transcripts from Jiminny at scale.

Recording links expire after 24 hours

Media URLs returned by the Jiminny API are temporary. If your pipeline needs access to the audio or video, download and store the files within 24 hours. Transcript text remains accessible — only the media links expire.

API key management

Jiminny uses admin-generated API keys for authentication. If a key is rotated or revoked, all dependent integrations break. Track which systems use which key, and set up monitoring for auth failures.

Batch size considerations

The API performs best with batch requests of 500–1000 activities at a time. Requesting too many in a single call can lead to timeouts, while too few increases the number of round trips needed for a backfill.

Transcript processing delay

Jiminny processes recordings asynchronously. Attempting to fetch a transcript too soon after a call ends will return empty or incomplete data. Build in a delay or retry mechanism.

Speaker label inconsistencies

Speaker identification isn't always perfect. Multiple participants, poor audio, or unregistered users can lead to misattributed utterances. Validate labels before using them for per-speaker analysis.

Pagination and cursor tracking

Activity listing endpoints return paginated results. Track your cursor position carefully — losing a cursor mid-backfill means re-scanning from the start or risking missed records.

Duplicate processing protection

Without idempotency checks, re-running an extraction flow can process the same call twice. Use activity IDs as deduplication keys to ensure each transcript is handled exactly once.

FAQ

Frequently Asked Questions

Explore

Jiminny - How to Get Your Conversation Data

What Data You Can Extract From Jiminny

How to Get Transcripts via the Jiminny API

Authenticate

List activities by date range

Fetch the transcript

Handle rate limits and recording expiry

Key Extraction Flows

Backfill (Historical Export)

Incremental Polling

Scheduled Batch Processing

Send Jiminny Transcripts to Automation Tools

Jiminny → Zapier → Semarize → CRM

Setup steps

Jiminny → n8n → Semarize → Database

Setup steps

Jiminny → Make → Semarize → CRM + Slack

Setup steps

What You Can Do With Jiminny Data in Semarize

Coaching Framework by Segment

Cross-Rep Talk Track Effectiveness

Deal Handoff Quality Scoring

Conversation-Powered Revenue Intelligence Dashboard

Common Challenges & Gotchas

Frequently Asked Questions

Explore Semarize

Get Started

Developer Quickstart

Pricing

How It Works

Bricks

Kits

Jiminny - How to Get Your Conversation Data

What Data You Can Extract From Jiminny

How to Get Transcripts via the Jiminny API

Authenticate

List activities by date range

Fetch the transcript

Handle rate limits and recording expiry

Key Extraction Flows

Backfill (Historical Export)

Incremental Polling

Scheduled Batch Processing

Send Jiminny Transcripts to Automation Tools

Jiminny → Zapier → Semarize → CRM

Setup steps

Jiminny → n8n → Semarize → Database

Setup steps

Jiminny → Make → Semarize → CRM + Slack

Setup steps

What You Can Do With Jiminny Data in Semarize

Coaching Framework by Segment

Cross-Rep Talk Track Effectiveness

Deal Handoff Quality Scoring

Conversation-Powered Revenue Intelligence Dashboard

Common Challenges & Gotchas

Frequently Asked Questions

Explore Semarize

Get Started

Developer Quickstart

Pricing

How It Works

Bricks

Kits

Related Resources

Get Your Data

Automation

CRM & Data

Playbooks

Blog