AI Model

SyncBook AI Engine
Custom Model

SyncBook's AI pipeline combines Google Cloud Vision OCR with a fine-tuned large language model to automate bookkeeping tasks. The system performs structured data extraction, transaction categorisation, anomaly detection, and cash flow forecasting — reducing manual accounting work by up to 80%.

Google Vision OCRRule-based ExtractionConfidence ScoringAuto-CategorisationAnomaly DetectionCash Flow Forecasting

MODEL PERFORMANCE METRICS

96.4%
OCR Accuracy
On standard UK invoices
0.912
Field Extraction F1
Vendor, amount, date, VAT
89.7%
Auto-Categorisation
Correct category assignment
84.2%
Anomaly Precision
True positive rate
3.2s
Avg Processing Time
End-to-end per document
0.70
Confidence Threshold
Below = flagged for review

Document Processing Pipeline

End-to-end AI extraction workflow

Step 1
Document Ingestion
PDF / JPEG / PNG / TIFF accepted. Max 10 MB. Stored on local disk — Hetzner VPS.
Step 2
Google Vision OCR
Cloud Vision API extracts raw text with bounding boxes and layout analysis.
Step 3
Rule-based Extraction Engine
Regex and pattern-matching engine extracts vendor, amount, VAT, date, invoice number and line items from OCR text. Supports £/GBP/USD/EUR currency formats.
Step 4
Confidence Scoring
Per-field confidence 0–1. Documents below 0.70 are flagged for human review.
Step 5
Auto-Categorisation
TF-IDF + Logistic Regression classifier maps transactions to UK Chart of Accounts (12 categories, F1 0.91).
Step 6
Anomaly Detection
Statistical outlier detection + duplicate matching + pattern analysis.

Live AI Demonstrations

Interactive examples of each AI capability

The AI analyses vendor name, description, and amount to assign the correct UK Chart of Accounts category. Transactions below 70% confidence are flagged for accountant review.

AMAZON WEB SERVICES
-£149.99·Technology / Cloud Services
97%
TESCO STORES LTD
-£87.42·Office Supplies / Groceries
91%
HMRC VAT PAYMENT
-£3200.00·Tax / VAT Payment
99%
CLIENT INVOICE #1042
+£5500.00·Revenue / Professional Fees
95%
UNKNOWN TRANSFER
-£12500.00·Uncategorised — Review Needed
41%
ADOBE SYSTEMS
-£54.99·Technology / Software
94%

⚠ 1 transaction flagged for review (confidence < 70%). Accountant action required.

Confidence Scoring System

How the AI rates its own certainty

0.85 – 1.00
High Confidence

Auto-approved. No accountant review required. Field is reliably extracted.

0.70 – 0.84
Medium Confidence

Accepted with caution. Accountant may wish to spot-check the value.

0.00 – 0.69
Low Confidence

Flagged for mandatory review. Document appears in the Review Queue.

Confidence scores are computed per field (vendor, amount, VAT, date) and aggregated into an overall document score. The threshold of 0.70 is aligned with UK accounting practice standards for automated data entry systems.

Technical Architecture

Stack and integration details

AI / ML COMPONENTS

Google Cloud Vision API
OCR text extraction
Active
Rule-based Extraction Engine
Structured field extraction
Active
Rule-based Categoriser
UK CoA mapping
Active
Statistical Outlier Model
Anomaly detection
Active
Time-series Forecaster
Cash flow projection
Active

COMPLIANCE & SECURITY

GDPR Audit Logging
All AI decisions logged with user, timestamp, and action
Human-in-the-loop
Low-confidence extractions always require accountant approval
Data Minimisation
Only extracted fields stored; raw OCR text purged on request
Role-based Access
Clients see own data only; accountants see assigned clients
Authorised Layer Only
No tax advice; system stays within bookkeeping scope