SMS Clone
A specialized Android application that extracts SMS and broader device artifacts into structured forensic datasets for LLM training, evidence handling, and longitudinal change tracking.
Production
Android product build focused on acquisition, monitoring, packaging, and repeatable dataset generation
Product architect, Android engineer, and forensic workflow designer
SMS Clone
A specialized Android application that extracts SMS and broader device artifacts into structured forensic datasets for LLM training, evidence handling, and longitudinal change tracking.
Requirements Snapshot
- Problem solved: Teams preparing personal communication datasets or mobile evidence often need more than raw exports: they need clean threading, integrity verification, recoverability signals, and structured packaging.
- Business value: Turns fragmented mobile records into reusable JSON datasets and forensic artifacts, reducing manual preprocessing while improving traceability, auditability, and evidence quality.
- Target users: Developers, Researchers, Digital forensics analysts, OSINT teams, AI dataset builders
- Architecture style: Android forensic pipeline with acquisition services, shadow-mirror change tracking, structured export packaging, and background sync automation
Each section below maps to the structure requested in the specification: problem framing, architecture thinking, engineering challenges, impact, and roadmap.
Overview
SMS Clone is designed to bridge mobile communication data and AI dataset preparation by extracting conversations and related artifacts into structured, portable forensic outputs.
What It Solves
The product groups fragmented SMS records into chronological, contact-specific threads while preserving operational signals around changes, integrity, and packaging.
- Conversation-ready JSON output
- Deleted-message safety net via realtime change logging
- Cleaner training data for fine-tuning and analysis
Forensic Depth
Beyond SMS, the app captures supporting mobile evidence such as call logs, contacts, calendar data, browser history, app usage, and media EXIF metadata.
- SHA-256 hash generation for extracted artifacts
- High-quality HTML forensic reports
- Single ZIP evidence container for handoff
Technical Direction
The implementation is centered on Kotlin, WorkManager, Room shadow mirroring, and Android storage APIs so the export pipeline can run safely in the foreground or on scheduled daily syncs.
Tech Stack and Why
Screenshots / Gallery
Primary portfolio visual for this product.
- SMS monitoring notification
- Manual SMS export screen
- Forensic export report
Results / Impact
- Public GitHub repository
- Supports SMS plus call logs, contacts, calendar events, browser history, and app usage artifacts
- Includes root-aware SQLite recovery hooks for deleted record recovery
- Exports JSON, JSONL, HTML reports, and compressed evidence bundles
Need this level of thinking applied to your own product?
This case study format now creates a stronger path from portfolio browsing into hiring, consulting, or product delivery conversations.