Push notification architecture for company apps

Push notifications in internal apps serve a fundamentally different purpose than in consumer apps. There is no engagement optimization, no re-engagement campaign, no A/B testing click-through rates. Internal notifications are operational signals: a work order assignment, a safety alert, a schedule change, an approval request. Missed or delayed notifications have direct operational consequences. The architecture must prioritize reliability and deliverability over the marketing-oriented features that dominate third-party notification platforms.

Platform mechanics and their constraints

Apple Push Notification service (APNs) and Firebase Cloud Messaging (FCM) remain the only reliable paths to deliver push notifications on iOS and Android respectively. There is no practical way to bypass these platform services for background-delivered notifications on mobile. This is a hard dependency that must be acknowledged and engineered around, not resisted.

APNs requires token-based or certificate-based authentication from the sending server. Token-based authentication (using a signing key) is strongly preferred — the key does not expire annually like certificates, reducing operational overhead. FCM accepts server keys or OAuth2 credentials and handles the delivery to Android devices and, optionally, iOS devices through its cross-platform layer. Running FCM as the unified sender simplifies the server side but introduces a dependency on Google infrastructure for iOS delivery, which some organizations prefer to avoid.

Both services offer best-effort delivery with no guaranteed latency. Messages can be delayed by platform-level throttling, device power management (Doze mode on Android, background app refresh limits on iOS), or simple network conditions. For time-critical notifications — emergency safety alerts, for example — the app must implement a fallback: SMS via a gateway like Twilio, or an in-app polling mechanism that checks for pending alerts on a short interval when the app is foregrounded.

Server-side architecture

The notification pipeline should be decoupled from the business logic that triggers it. When a work order management system assigns a task, it publishes an event to a message broker (RabbitMQ, Kafka, or a managed service like Amazon SNS/SQS). A dedicated notification service consumes these events, resolves the target user’s device tokens, formats the payload, and dispatches to APNs/FCM.

This separation provides several advantages. The originating system does not need to know about device tokens, platform-specific payload formats, or delivery retry logic. The notification service can be scaled, monitored, and updated independently. Failed deliveries can be retried without re-triggering the business event.

Device token management is the most underestimated operational concern. Tokens change when a user reinstalls the app, restores a device, or migrates to new hardware. The notification service must handle token refresh events from the client and purge invalid tokens based on APNs/FCM feedback. A token registry that accumulates stale entries degrades delivery rates and wastes API quota.

Payload design should be minimal and actionable. The notification carries enough context for the user to decide whether to act — “Work Order #4412 assigned: Pump Station 7, Priority High” — without embedding sensitive data in the push payload, which may be visible on lock screens. Deep links should route directly to the relevant screen in the app, bypassing the home screen entirely.

Monitoring and observability

A notification that was sent is not a notification that was received. Delivery confirmation requires the client app to acknowledge receipt by calling back to the server when the notification is opened or when the app foregrounds and processes a silent notification. This closed-loop tracking reveals actual delivery rates, latency distributions, and failure patterns.

Alerting should trigger when delivery rates drop below a baseline — a sudden decline often indicates expired credentials, a misconfigured payload, or a platform-side incident. Dashboard visibility into queue depth, send rate, failure codes (invalid token, payload too large, rate limited), and acknowledgment rate gives the operations team the data needed to diagnose issues before users report them.

Takeaway

Internal push notification architecture should be built for reliability and operational transparency, not engagement metrics. Decoupling the notification pipeline from business systems, managing device tokens rigorously, and implementing delivery confirmation transforms notifications from a best-effort convenience into a dependable operational channel.