serverless Archives - prodSens.live

AWS AI COST OPTIMIZER

Johann Strasser — Sun, 08 Jun 2025 18:20:08 +0000

AWS AI Cost Optimizer leverages generative AI through AWS Bedrock to analyze cloud infrastructure costs and provide actionable optimization recommendations.
YOUTUBE

The post AWS AI COST OPTIMIZER appeared first on prodSens.live.

Document Processing Libraries for Word to PDF on .NET 8 (Linux Azure Functions)

Dennis Shiao — Tue, 20 May 2025 00:21:26 +0000

Document Processing Libraries for Word to PDF on .NET 8 (Linux Azure Functions)

Introduction

Converting Word documents to PDF in a serverless .NET 8 (Linux) environment (like Azure Functions on a Linux Consumption plan) requires a cross-platform document library. Key criteria include: support for Word DATE field manipulation (even if fields are malformed, as Aspose.Words handles), high-fidelity PDF output, compatibility with .NET 8 and Linux (no Windows dependencies), and smooth deployment in Azure Functions. The solution should allow time-shifting of DATE fields (e.g. adjusting the date fields’ value before conversion), use a cross-platform rendering engine (e.g. SkiaSharp instead of GDI+), and offer a redistribution-friendly license at a cost lower than Aspose.Words (≈$3,597 Developer OEM or $99/yr plugin) or Syncfusion (≈$995 per developer).

This report compares several production-grade libraries against these criteria, including their technical capabilities (field handling, PDF fidelity, performance), licensing model and cost, and any deployment considerations. We also touch on open-source approaches for completeness.

Comparison of Solutions

Aspose.Words for .NET

Aspose.Words is a feature-rich and mature library widely regarded as the benchmark for Word automation. It supports all Word field types (including complex and nested fields) and robustly handles even malformed fields. Developers can update fields (DATE, TIME, NUMPAGES, etc.) programmatically via Document.UpdateFields(), and Aspose will calculate the results (the DATE field updates to the current date/time by default). While there’s no built-in “timeshift” parameter for date fields, you can find the DATE field and insert a custom value or adjust the system culture/time if needed. PDF output fidelity is excellent — Aspose strives to match Word’s layout and formatting precisely, including images, drawings, and advanced elements. On .NET 8/Linux, Aspose.Words is fully supported; in fact, Aspose uses SkiaSharp under the hood for graphics so it doesn’t depend on Windows GDI+. (For example, Aspose’s .NET 6+ builds require adding SkiaSharp native assets on Linux). This makes it compatible with Azure Functions on Linux (just include the Aspose NuGet and the SkiaSharp native libraries). Performance is generally good, though the library is heavyweight (the DLL is large, and loading the font/rendering engine may add to cold start time).

Licensing: Aspose is one of the pricier options. A Developer OEM license (royalty-free deployment) costs around $3,597 per developer. Aspose recently introduced a more flexible “plugin” model — e.g. a $99/year metered license that covers specific functionality like Word-to-PDF conversion. This lower entry fee gives you Aspose’s engine but charges per document/usage (pay-as-you-go) — it can be cost-effective for low volumes, but costs will scale with usage. Aspose’s support and community are strong (dedicated support forums, extensive docs). In summary, Aspose.Words offers the richest functionality (e.g. it will gracefully handle broken field codes and complex scenarios) and top output quality, but you pay a premium for it.

Syncfusion Essential DocIO

Syncfusion’s DocIO library (part of Essential DocIO/File Formats) is a powerful .NET Word processing API that supports creating, editing, and converting documents without Office. It supports updating common fields in Word documents, including DATE, TIME, IF, formulas, MERGE fields, etc.. For example, you can call document.UpdateDocumentFields() to recalc fields; DATE fields will update to current date by default. If you need to shift the date, you could manually set the field’s result text before conversion. DocIO produces high-quality PDFs — it can convert DOCX to PDF (and other formats like HTML, RTF) preserving formatting, images, and layouts. Some known limitations exist in the Word-to-PDF conversion: for instance, grouped shapes, drawing canvas, and table auto-resizing aren’t fully supported in the layout engine, which may lead to minor discrepancies in page numbers or positioning. In practice, most standard documents convert with good fidelity.

DocIO is .NET 8 compatible and works on Linux. Recent versions have switched to a cross-platform drawing solution. (Syncfusion’s libraries historically used System.Drawing; however, on Linux, their newer versions use SkiaSharp internally via a Syncfusion.Drawing.SkiaSharpHelper — if you see an error about SkiaSharp.SKImageInfo, it means you need to deploy Skia’s native assets). In an Azure Functions (Linux) environment, you’d include the Syncfusion.DocIO package and the SkiaSharp NativeAssets (or ensure libgdiplus is installed if using an older version). Syncfusion explicitly notes no external dependencies or per-server fees for their file format libraries, making them cloud-friendly. Performance is tuned for server use (they advertise “blazing-fast” and low memory usage), suitable for the stateless, short-lived function scenario.

Licensing: Syncfusion is more affordable than Aspose, and royalty-free. A commercial license (which includes the entire Syncfusion suite) is roughly $995 per developer per year, with no deployment charges. Notably, Syncfusion offers a free Community License for companies with < $1 million USD in revenue and <= 5 developers, which includes all features. This can bring the cost to $0 if you qualify, making Syncfusion extremely cost-effective. Even for larger organizations, the single price covers unlimited use of all their UI controls and file libraries, which is a great value. Syncfusion’s support is solid (24h forum responses, direct technical support for licensed users), and the product has a large user base (meaning many community Q&As and resources). Overall, Syncfusion DocIO meets all the requirements: it supports field manipulation (DATE fields, etc.), high-quality PDF output, .NET 8/Linux compatibility, and a much lower cost than Aspose. The minor fidelity gaps (like certain shapes) should be weighed against the cost savings.

GemBox.Document

GemBox.Document is a lightweight yet feature-rich .NET library for Word and PDF processing. It allows reading, writing, modifying documents and converting to PDF, all in managed .NET. GemBox supports Word fields and provides APIs to manipulate or update them. In fact, DocumentModel.CalculateFormula() or Field.Update() can update many field types. According to GemBox’s docs, the Field.Update() method supports fields such as Date, Time, SaveDate, Author, DocProperty, IF, Formula, etc.. This includes the DATE field — calling update will insert the current date. If you need to adjust the date (e.g. time shift), you could manually set the field’s result text or use GemBox’s mail-merge for more control. The library is known for high-fidelity PDF output; it handles images, tables, headers/footers, and even charts/shapes by rasterizing via a graphics engine. On non-Windows platforms, GemBox.Document uses HarfBuzz for text layout and SkiaSharp for rendering. You’ll need to include the native SkiaSharp and HarfBuzz packages (e.g. SkiaSharp.NativeAssets.Linux) when deploying. This approach ensures independence from System.Drawing — no GDI+ required — and yields consistent results across platforms. GemBox explicitly documents support for Azure Functions and Azure App Service deployments, with examples of usage in such environments. (One note: printing and saving to XPS are not supported on Linux, but PDF is fully supported.)

GemBox is praised for being easy to integrate (single DLL) and for its performance. It has a smaller footprint than Aspose/Syncfusion, which can help in serverless cold starts. Memory usage is efficient, and it can handle large documents (subject to available memory).

Licensing: GemBox uses a simple licensing model — per-developer perpetual license with royalty-free deployment. For unlimited deployments, you just need (for example) a single developer license if only one developer works on the project. The cost is $890 per developer (one-time), which includes one year of support and updates. You can deploy to unlimited servers or functions with no extra fees. This price is significantly lower than both Aspose and Syncfusion’s standard license. Renewal in future years (for continued updates) is optional at a discount, but the license is perpetual (the library will keep working indefinitely). GemBox.Document hits a sweet spot for cost vs. capabilities: it covers the needed features (field updates, PDF conversion quality), is fully cross-platform via SkiaSharp, and has transparent, royalty-free licensing at a moderate cost. The company behind it is smaller, but they have been around for many years and have an active support forum and good documentation. For many scenarios, GemBox.Document is a top recommendation.

Spire.Doc (E-iceblue)

Spire.Doc is another popular library for .NET that enables creating, editing, and converting Word documents without Microsoft Office. It supports all common document elements (paragraphs, tables, images, etc.) and provides field and mail-merge functionality. Spire.Doc can update fields like merge fields and formula fields; it likely can handle simple DATE fields as well (though documentation on field codes is limited, the library is intended to mimic Word’s capabilities). Quality of PDF output from Spire.Doc is generally good — it preserves text formatting, images, vector drawings, and so on. In independent reviews, Spire’s conversion is close to Aspose’s, though occasionally users report minor formatting differences (it’s wise to test complex documents).

Spire.Doc supports .NET Core/5/6/7 and thus works on .NET 8. On Linux, Spire.Doc relies on System.Drawing.Common for some rendering tasks. Their documentation notes that you must have the libgdiplus native library installed on Linux for Spire (and their other file components) to function. In an Azure Functions (Linux) environment, you typically cannot install apt packages on the fly; the solution is to use a custom container for your Function that has libgdiplus (and any required fonts) installed, or to include the libgdiplus dependency via the Azure Functions settings. Newer versions of .NET discourage System.Drawing on Linux, but Spire has continued to use it (unlike GemBox/Syncfusion which moved to Skia). This means a bit more deployment work for Spire on Linux. Once libgdiplus is in place, Spire.Doc runs normally under Linux. There’s no dependency on Windows-specific APIs beyond that. Performance is decent; Spire is known to be fairly fast at conversion, though memory usage can spike with very large documents (like other libraries).

Licensing: Spire.Doc is sold by e-iceblue with various license tiers. For a single developer, the Pro Edition is about $999; this is a perpetual developer license that allows royalty-free deployment on an unlimited number of servers (for one project/product). They also offer a Developer OEM license around $2,999 which is intended for embedding in applications distributed to third parties (or SaaS scenarios with unlimited deployment locations). In many cases, the $999 developer license suffices for using Spire in your own cloud service (since you’re not distributing the library itself to end-users, it’s running in your environment). Compared to Aspose, Spire is considerably cheaper; compared to Syncfusion or GemBox, the price is in a similar ballpark (slightly higher than GemBox for one dev, but Spire includes support and updates for a year and does not require renewal unless you want upgrades). e-iceblue provides support via a forum and email. Community-wise, Spire has a presence (e.g. NuGet packages “FreeSpire.Doc” for an older free edition with limitations, StackOverflow questions, etc.), though it’s not as high-profile as Aspose or Syncfusion. If cost is a major factor and you can manage the libgdiplus requirement, Spire.Doc is a viable alternative that meets the technical needs: it can manipulate Word fields, produce good PDF output, runs on .NET 8/Linux, and its licensing is more transparent and affordable than the big-name libraries.

SautinSoft Document .NET

Document .NET by SautinSoft is a lesser-known but capable document processing SDK. It allows full create/read/update/convert for Word, PDF, RTF, HTML, etc., in a single library. You can think of it as a direct competitor to GemBox or Aspose in terms of features. It supports form fields, mail merge, and likely provides an API to manipulate fields (the vendor’s site shows examples of creating form fields and performing mail merges). While specific documentation on updating DATE fields is not readily found, Document .NET’s feature list implies it can handle common field codes and even digital signatures, find/replace, and moresautinsoft.com. Its PDF conversion quality is reported to be good — SautinSoft specializes in conversion libraries, so they aim to preserve layout and formatting. They even have a product comparison that pits their library against GemBox, suggesting they consider themselves in the same league. One known aspect: on Windows, SautinSoft can use GDI+/WPF for rendering; on Linux, it likely uses SkiaSharp or ImageSharp internally (the product mentions using SixLabors.ImageSharp for images on non-Windows) and possibly HarfBuzz for text, similar to GemBox. This isn’t explicitly stated, but given .NET 6+ requirements, they would have had to move away from System.Drawing to fully support Linux. Thus, Document .NET should work in Azure Functions on Linux with no special dependencies (aside from including the SautinSoft NuGet and any of its sub-packages).

Deployment is straightforward (just include the DLLs). Performance is generally good, though SautinSoft might not have the same level of optimization for extreme cases as Aspose does. Still, for typical documents the conversion is quick.

Licensing: SautinSoft uses tiered licenses based on company size and usage. An Individual Developer license costs $990sautinsoft.com and is intended for a single freelance developer or individual — but it limits deployment to 1 production server or domain (per their notes). For a company scenario (multiple devs or scaling out to many instances), the Small Company license is $3,960 (covers up to 10 developers and unlimited deployments). All licenses are perpetual and royalty-freesautinsoft.com (no runtime fees) and include 1 year of updates. The “Large Company” tier ($9,900) covers unlimited devs, which likely isn’t needed here. So, for a fair comparison: if you are a lone developer deploying to a single Azure Function instance, $990 covers it; if you need scale or have a team, $3,960 covers a lot. This pricing is still lower than Aspose’s $3,597 for one dev OEM, but higher than GemBox or Syncfusion for a small team. SautinSoft may be more attractive if you specifically need some of their niche features or their flexible PDF/HTML conversion options. Support is provided via email and even Skype/phone for paid users, according to their site, and their responsiveness is generally good for customers. The community around SautinSoft is smaller, meaning less publicly available Q&A, so you’d rely on their support for tricky issues. In summary, Document .NET meets the requirements (field manipulation, PDF conversion, .NET 8/Linux support) and is deployment-friendly. Its cost sits in the middle; it could be a good fit if you find Aspose/Syncfusion too costly but want an alternative to GemBox/Spire. Be mindful of the license conditions regarding number of servers if you choose the entry-level option.

Xceed Words for .NET

Xceed Words for .NET is part of Xceed’s suite of libraries and is essentially the commercial cousin of the open-source DocX library. It enables creating and modifying .docx documents and also converting them to PDF. Notably, PDF conversion was added in the paid version — the free DocX library (available on GitHub) does not support PDF export. With Xceed Words, you can manipulate all content (paragraphs, images, tables, etc.) and then call document.SaveAsPdf(...) (using Xceed.Pdf internally) to get a PDF. The library is 100% managed .NET and works on .NET 5/6/7/8 (and even .NET 9, according to their site). It does not require Microsoft Office. For fields: Xceed Words can access and insert content controls and merge fields, but it does not have a built-in field evaluation engine like Aspose or Syncfusion. This means if your Word docs have fields (DATE, TOC, formulas), Xceed might treat them as static text or simply preserve their placeholder. It won’t automatically update a DATE field to today’s date — you would need to find that field’s placeholder text and replace it manually in the document. Depending on your use case, this may be acceptable (e.g. you could search for a date field and substitute a value). But if complex fields (like an IF field or a page number) are needed, Xceed won’t calculate those. This is a trade-off for a simpler library.

When converting to PDF, Xceed’s output is generally good for text and basic formatting, but “not all Word elements are supported” — the company explicitly advises using the trial to verify complex elements. For example, things like gradients in shapes, certain chart types, or embedded OLE objects might not render. If your documents are mostly text, lists, tables, and images, Xceed should handle them well. Xceed Words uses its own PDF generation engine (from Xceed.PDF), likely leveraging SkiaSharp or PDFSharp behind the scenes, but importantly it doesn’t require any external renderer on your part. It runs on Linux without needing libgdiplus or other native dependencies. This makes it Azure Functions friendly — you just include the Xceed assemblies. It’s also quite lightweight and fast for most operations (DocX has been known for good performance in generating documents).

Licensing: Xceed offers this library under a few licenses. The Standard (or Blueprint) license is normally ~$1,199, currently discounted to $849.95 for one developer (with source code access included at that price). They have a Small Business license at $349.95 (discounted) for up to 3 developers, intended for companies under $1M revenue, which is a great deal if you qualify. Both include a year of support/updates and allow unlimited installations (royalty-free deployment). This means you can deploy to any number of Azure Function instances with no extra fees. Xceed’s pricing is transparent and one-time (perpetual). $349–$849 is very competitive for a commercial library.

The support from Xceed is typically via their support portal or email, and they are known to be responsive. The community around DocX (the free core) is active on GitHub, but since PDF conversion is only in the paid version, you might not find community examples of that — however, their documentation covers how to use it, and their support can help with issues. In summary, Xceed Words for .NET is a cost-effective option that covers basic to intermediate Word-to-PDF needs on .NET 8/Linux. It’s ideal if your docs don’t rely heavily on Word field logic or other advanced features. You get a straightforward API, likely the lowest cost for a small team (especially with the small-biz pricing), and easy deployment. Just be sure to test it on your documents to ensure the output meets your quality requirements.

DevExpress Office File API (Word Processing Document API)

DevExpress, known for its UI controls, also provides an Office File API which includes a Word (RTF) Document API. This is essentially the document engine from their rich text editor control, offered as a headless library. It can load, modify, and save Word documents, and importantly export them to PDFdevexpress.com. DevExpress supports all the common Word formats (DOC, DOCX, RTF, etc.) and can handle elements like tables, styles, lists, etc. It also supports mail merge, bookmarking, and field insertion. Because it originates from a UI control, it has the capability to layout documents and thus can compute fields like page numbers, but details on updating fields via API are sparse — likely it updates at least PAGE and NUMPAGES during pagination. For a DATE field, you might need to manually replace it with DateTime.Now.ToString() in the document model, as the engine will not automatically insert current date unless it’s configured to (the DevExpress RichEdit control can update fields on open, but via API you might simulate that). DevExpress’s PDF output is quite high fidelity. It supports PDF/A standards, graphics, embedded fonts, and even right-to-left text properly. Essentially, if their RichEdit can display it, the PDF export will print it that way. Their layout engine is very good with even complex documents. One advantage is support for Tagged PDF (PDF/A, PDF/UA) which is useful for accessibility, if that matters to you.

The DevExpress Document API is cross-platform. They provide a “cross-platform drawing engine” for use on Linux/macOS so you can avoid System.Drawing. In .NET 6+, DevExpress had to address the System.Drawing limitation; their solution was to use Skia or a custom implementation. There is a flag to enable their Linux drawing support, and it still uses libgdiplus under the hood for certain pieces, according to their support articles (so you may need to include a specific version of libgdiplus). Newer DevExpress versions might have eased this by bundling SkiaSharp — this is worth verifying. In any case, Azure Functions deployment is possible: many users run DevExpress Reporting or Office API in Azure. It may involve adding the DevExpress runtime NuGet and potentially tweaking the runtimeconfig.json to load Skia or GDI on Linux (DevExpress documentation provides guidance if needed). Performance-wise, DevExpress’s library is quite optimized (it’s used in desktop apps for real-time editing). It might have a larger initial load (similar to Aspose in size), but conversion speed is good.

Licensing: DevExpress sells the Office File API as part of their subscriptions. You can get it with the DevExpress Universal or Office File API Subscription. They list the Office File API (which includes Word, PDF, Excel libs) at $599.99 per developer (per year). This is a subscription price that includes updates and support; you can also opt for a perpetual license (one-time ~$1,199 with one year updates, similar to their UI components pricing). The $599/year is quite reasonable given it covers multiple libraries. Deployment is royalty-free. If you already use DevExpress UI controls, you likely have this included. DevExpress has excellent tech support (ticket-based) and a community of users on their forums. The downside might be that for just Word-to-PDF, you’re buying a broader package (you get Excel and PDF libs too). But at ~$600, it’s still cheaper than Aspose and comparable to others. DevExpress’s solution is production-grade and backed by a big vendor; it could be overkill if you don’t need the extra features, but it guarantees a high level of quality and support.

Telerik Document Processing (WordsProcessing)

Telerik (Progress Software) offers WordsProcessing as part of its UI libraries. This library enables importing, manipulating, and exporting Word documents in .NET Standard (so .NET 8 compatible, cross-platform). It can convert DOCX to PDF as well. The capabilities cover most Word content: paragraphs, runs, tables, styles, etc. It supports mail merge functionality and some field types (merge fields, date fields as static text, etc.), but like Xceed, it does not automatically evaluate complex fields (TOC, formulas) — those are usually preserved or require manual update. The PDF output from Telerik is quite good for standard documents. One notable limitation from users’ reports is that it might not support all features like nested tables or certain types of content controls perfectly. However, for many applications it’s sufficient. Under the hood, Telerik’s WordsProcessing uses SkiaSharp on .NET Core for rendering (Telerik even published articles on how to build and install libgdiplus or use Skia for their libraries). In recent versions, they’ve aligned with Microsoft’s direction and likely use Skia by default on non-Windows. So deployment on Azure Functions (Linux) should just require including the necessary SkiaSharp native assets. No COM or Office needed.

Licensing: Telerik doesn’t sell WordsProcessing standalone; it’s included in DevCraft UI/Complete/Ultimate bundles. The DevCraft UI (which includes all .NET and JS UI controls + document processing libraries) is $1,149 per developer/year (or $1,299 with priority support)telerik.com. This is a larger purchase if you only need the document library. If your company already has Telerik for UI, then you effectively have this library “for free”. The license is royalty-free for deployment. Support is provided to licensed customers (Telerik has good support). If cost is a concern and you’re not using their UI, this is not the cheapest route — you’d be paying for a suite. But it’s production-ready, and Telerik’s library is fairly refined. In cost context, $1,149/yr is more than GemBox or Syncfusion, but it does come with a lot more (all their controls).

Summary of Commercial Libraries: All the above libraries (Aspose, Syncfusion, GemBox, Spire, SautinSoft, Xceed, DevExpress, Telerik) are compatible with .NET 8 and Linux, and can be used in Azure Functions. They differ in how they handle image/font rendering on Linux: Aspose, Syncfusion, GemBox, and Telerik use SkiaSharp (which avoids the deprecated System.Drawing on Linux). Spire and (likely) older DevExpress use libgdiplus (GDI+) which requires a bit of extra work to include. All support PDF conversion with decent to excellent fidelity. For field manipulation: Aspose and Syncfusion are the most robust (they can update almost all field types, e.g. IF, formulas, page refs). GemBox supports many (Date, SaveDate, etc., though not all). Spire’s field support is ok for merge fields; Xceed and Telerik have limited field logic (fields may need manual handling). If time-shifting DATE fields is crucial, the libraries don’t have a built-in parameter for this, but you can achieve it by setting the field’s value manually: e.g. in GemBox or Aspose, find the Field of type Date and insert a specific date string before saving to PDF. This approach works consistently across these APIs.

Next, we’ll compare these options in a structured way, including features and pricing:

Feature Comparison Table

Library	Field Manipulation	PDF Output Fidelity	.NET 8 & Linux Support	Azure Functions Ready	Rendering Engine (Linux)	License Model
Aspose.Words	Comprehensive — Supports all Word fields (DATE, TOC, formulas, etc.), even nested/malformed ones. Developers can update fields easily; no auto time-shift but field value can be set in code.	Excellent — Near pixel-perfect conversion; preserves formatting, images, drawings, and complex layouts with very high fidelity.	Yes — .NET Standard 2.0+; works on Linux. Uses SkiaSharp for graphics (no Windows dependencies).	Yes — Used in Functions via NuGet. Large library (~30MB); include SkiaSharp native libs. Cold start a bit higher, but runtime performance is strong.	SkiaSharp (via Aspose’s integration)	Per-developer; OEM royalty-free. ~$3,597 per dev OEM. Also offers new $99/year “feature plugin” licenses (usage-metered). High cost, premium support.
Syncfusion DocIO	Extensive — Can insert/edit fields and update many types: DATE, TIME, IF, formulas, merge fields, etc.. Lacks a direct time-shift feature, but field text can be replaced manually.	Very Good — Preserves most content and layout. Minor limitations (e.g. grouped shapes, certain auto-fit scenarios not fully supported). Generally accurate for standard documents.	Yes — .NET 6/7/8; fully cross-platform. Uses SkiaSharp internally for rendering on Linux (no MS Office or GDI required).	Yes — Tested in Azure App Service and Functions (Linux). Need to deploy with SkiaSharp.NativeAssets or ensure libgdiplus. Optimized for serverless (no per-server licensing, low memory).	SkiaSharp (Syncfusion’s SkiaHelper)	Per-developer; royalty-free. ~$995 per dev/year (includes entire Syncfusion suite) — no runtime fees. Free Community License available. Excellent value if eligible; otherwise moderate cost.
GemBox.Document	Good — Supports common fields: DATE, TIME, AUTHOR, SAVEDATE, IF, etc. (automatic update for these). More complex fields (e.g. cross-references) not auto-updated in current version. Allows manual field result editing.	Very Good — High fidelity PDF via its own layout engine. Preserves fonts, paragraphs, tables, images; uses HarfBuzz for text shaping (handles Unicode, RTL well). Few missing elements (e.g. cannot embed video/ole objects).	Yes — .NET Standard 2.0 and .NET 6+. Supports Linux, macOS, Android, etc. Uses HarfBuzz + SkiaSharp on non-Windows for text and drawing. No external dependencies (just include the nuget packages).	Yes — Documented usage in Azure Functions and Docker. Lightweight (~5 MB DLL). Just ensure to include `SkiaSharp.NativeAssets.` and `HarfBuzzSharp.NativeAssets.` for your platform. No special config needed beyond that.	SkiaSharp + HarfBuzz (for text)	Per-developer; perpetual. $890 per developer (one-time), including 1 year updates. Royalty-free unlimited deployment. Very transparent and cost-effective. Renewal (~60% off) optional for updates.
Spire.Doc	Moderate — Supports inserting fields and mail merge. Can update simple fields (e.g. merge fields, formulas) during conversion. Likely updates DATE field to current date on save, but documentation is sparse. May require manual intervention for unusual fields.	Good — In most cases output closely matches Word. Text, styling, images, and tables are preserved. Some complex elements (floating text boxes, intricate SmartArt) might have minor layout differences. Overall quality is acceptable for business docs.	Yes — .NET Core/5/6/7/8 supported. Requires libgdiplus on Linux (uses System.Drawing.Common). No Windows-only API usage beyond GDI.	Yes, with caveat — must supply `libgdiplus`. In Azure Functions, use a custom Linux container with libgdiplus installed (or use Azure Functions custom handlers). Once that’s in place, Spire runs smoothly. Memory usage can increase with very large docs.	System.Drawing/GDI+ on Linux (no built-in Skia use). (If using .NET 6+, ensure `System.Drawing.EnableUnixSupport` is true and libgdiplus present.)	Per-developer; plus OEM options. $999 per dev (Pro Edition) and $2,999 per dev OEM. Developer license is generally royalty-free for internal deployment (unlimited servers), OEM for embedding in distributed apps. Cheaper than Aspose; comparable to others.
SautinSoft Document .NET	Good — Full API for form fields, bookmarks, etc. Likely updates basic fields (date, page numbers) on save to PDF, though not all field types documented. Provides methods to programmatically find/replace content, so can handle DATE fields by replacing with desired text.	Good — Produces high-quality PDFs with formatting, images, tables intact. Aims for fidelity similar to GemBox/Aspose. Complex scenarios (e.g. tracked changes, odd WordArt) should be tested, but generally robust.	Yes — .NET Framework and .NET Core/5/6/7/8 supported. Cross-platform: works on Linux and macOS (the library uses either SkiaSharp or ImageSharp for rendering images/fonts on non-Windows). Does not require Microsoft Office.	Yes — No special setup needed; include the SautinSoft DLLs. Ensure any native dependencies (if any) are included — most functionality is managed. Suitable for serverless deployment (the Individual license, however, limits to one function instance — need company license for scaling out).	Likely SkiaSharp or ImageSharp internally for Linux (not explicitly stated, but no GDI requirement). The library “renders to WPF” on Windows and uses alternate on Linux.	Per-developer or company. $990 single-dev (individual)sautinsoft.com, limited deployment; $3,960 small company (up to 10 devs, unlimited deployments). Royalty-free runtimesautinsoft.com. Perpetual license with 1 year updates. More expensive for teams than GemBox/Spire, but still lower than Aspose.
Xceed Words	Basic — Can create and modify content controls/fields, but does not auto-calc field results. Treats fields as placeholders: e.g. a { DATE } field will remain as is unless you replace it with actual date text via code. Good for simple mail merge (you manually insert values). Not suitable if you need Word’s dynamic fields updated automatically.	Decent — Handles text, paragraphs, images, tables very well. Formatting fidelity is good for supported elements. However, “not all Word elements are supported” in PDF output — complex objects like charts, macros, or certain shapes might not render. For standard documents, quality is fine; for very complex ones, some content might be omitted or simplified.	Yes — .NET Standard 2.0+ and .NET 5/6/7/8. Fully managed and cross-platform. No reliance on System.Drawing or Office — uses its own PDF library (Xceed.PDF) which is cross-platform.	Yes — Very lightweight (~2 MB). Ideal for Azure Functions due to small size and no native dependencies. You simply deploy the DLLs. Fast startup and low memory footprint, but ensure to test if the output meets requirements (especially if documents have advanced features).	Xceed’s own engine (likely built on PDFSharp or similar; no external renderer needed). No GDI or Skia required on your part — it’s all in-package.	Per-developer; perpetual. $849.95 per dev (Standard), or $349.95 for Small Business (<=3 devs, <$1M revenue). Royalty-free deployments. Includes 1 year support/updates. Very cost-effective (especially small-biz pricing). Source code access included in standard license.
DevExpress Word API	Comprehensive — Supports inserting and editing all content, and can compute many field results when paginating (e.g. page numbers, section fields). It has a mail merge engine for merge fields. For DATE fields, it doesn’t auto-update on its own; you would set document’s built-in properties or use the RichEdit’s field update function. Generally, it’s capable of field logic but may require using its API (e.g. updating fields via RichEditDocumentServer).	Very Good — High-fidelity output leveraging a mature layout engine. Preserves formatting, lists, images, tables, headers/footers accurately. Supports even PDF/A, PDF/UA (accessible PDF) output. Complex scripts and RTL languages are handled. Essentially as good as Word in most cases (their RichEdit control is known for quality rendering).	Yes — .NET 6, 7, 8 supported. Runs on Linux and macOS. Requires enabling DevExpress’s cross-platform drawing (to avoid System.Drawing). This may involve adding SkiaSharp or specific libgdiplus versions as per their guidance. DevExpress has addressed .NET 6’s System.Drawing changes, so it is workable on Linux (some config steps might be needed).	Yes, with configuration — Many use it in Docker/Azure. You might need to include `DevExpress.Drawing` or set `EnableUnixSupport` in runtimeconfig for libgdiplus. Once configured, it works reliably. The library is mid-sized (~10–15 MB across assemblies). Cold start is reasonable; conversion performance is optimized (benefits from C++ underpinnings in DevExpress).	DevExpress Drawing (custom; can use GDI+ or Skia). Likely uses SkiaSharp indirectly when DevExpress.Drawing is enabled.	Per-developer; subscription or perpetual. About $599.99 per dev per year for Office File API package (includes Word, PDF, Excel APIs). Alternatively, part of DevExpress Universal ($2,199/yr for all products). Royalty-free runtime. Good support included. Moderate cost, given multi-library scope.
Telerik WordsProcessing	Basic/Moderate — Supports mail merge and inserting fields like MERGEFIELD, but to update fields (like a DATE or formula) you typically manually trigger an update or replace text. No full field engine for complex fields. Suited for merge scenarios where you fill in values via code.	Good — For most documents (text, formatting, tables, images) it’s accurate. It handles styles and paragraphs well. Some advanced Word features (e.g. content controls, intricate layouts) might not fully translate. Overall PDF quality is high for typical business documents.	Yes — .NET Standard 2.0 library. Tested on .NET 6/7/8. No dependencies on Windows-only APIs. On Linux, uses SkiaSharp or similar for rendering (Progress has documentation on using Skia instead of GDI for their libraries) — likely integrated by default now.	Yes — Include the Telerik.Documents.* assemblies. If needed, include SkiaSharp native assets (depending on version). Many users run Telerik document processing in Azure. The footprint is modest. Performance is good for single documents in a stateless scenario.	SkiaSharp (Telerik’s libraries leverage Skia on non-Windows, after .NET 6 System.Drawing changes).	Per-developer; in UI bundle. ~$1,149 per dev/year (DevCraft UI)telerik.com — includes all Telerik .NET UI controls + document libs. No standalone license for just WordsProcessing. Royalty-free deployment. Higher cost unless you already need the UI suite.

(Table Legend: **Field Manipulation* — ability to programmatically update or insert Word fields (especially DATE) and handle field logic; PDF Output Fidelity — how closely the PDF matches the original Word layout and features; Azure Functions Ready — any special steps needed to run in serverless (Linux) and typical performance considerations; Rendering Engine — what graphics/text system is used on Linux, indicating any native dependencies; License Model — type of licensing and cost indication.)*

Pricing Comparison

For clarity, here’s a pricing breakdown focused on unlimited deployment scenarios (royalty-free runtime). Prices are in USD and assume 1 developer license unless noted otherwise:

Library	License & Model	Cost (Dev License)	Unlimited Deployments	Notes on Pricing
Aspose.Words	Per-developer, Developer OEM license (perpetual)	$3,597 per developer	Yes (OEM allows unlimited apps/locations)	High upfront cost. Also offers Metered plans: e.g. new $99/year plugins for specific features (with usage fees) — can lower cost if usage is low. Support included for 1 year.
Syncfusion DocIO	Per-developer, annual or perpetual options (included in Syncfusion Essential Studio)	$995 per developer (year 1)****	Yes (no runtime or server fees)	Unlimited deployments royalty-free. Community License free for small companies. The $995 covers all Syncfusion components, so it’s a suite price. Renewal is optional (~$495/year for support/upgrades after first year).
GemBox.Document	Per-developer, perpetual license	$890 per developer (one-time)	Yes (royalty-free for any number of servers)	One-time purchase includes 1 year of updates/support. 40% discount on renewal (if desired). No OEM or server fees required — unlimited deployment is included. Very transparent pricing and straightforward licensing (just license per dev).
Spire.Doc	Per-developer and OEM tiers (annual subscription model for updates)	$999 per developer (Std.), or $2,999 per developer (OEM)	Yes (Std. covers unlimited internal deploy; OEM for embedding in distributed software)	Pricing from e-iceblue’s list: $999 is Developer Subscription (royalty-free on servers you own), $2999 Developer OEM allows embedding in client-side apps or SaaS with unlimited locations. Includes 1 year upgrades. Renewals ~50%. They also have site licenses for larger teams.
SautinSoft Document .NET	Individual or Company licenses (perpetual)	$990 — Individual (1 dev, 1 server)sautinsoft.com; $3,960 — Small Company (<=10 devs, unlimited servers)	Yes (with Small or Large company license)	Individual dev license limited to 1 deployment (1 production site). For Azure Functions scaling, likely need Small Company license which allows unlimited servers. All licenses royalty-freesautinsoft.com. Includes perpetual use + 1 year updates. 25% of cost for optional annual renewal.
Xceed Words for .NET	Per-developer, perpetual (with optional support renewal)	$849.95 per developer (Standard); $349.95 Small Business (<=3 devs)	Yes (unlimited installs on any device)	Standard license includes source code and 1-year priority support. Small Business gives a big discount if criteria met. Both allow unlimited deployment royalty-free. After 1 year, you can continue using the last version; support/updates renewal is optional (~50% of original cost).
DevExpress Office API	Per-developer subscription (annual) or perpetual	$599.99 per dev per year (Office File API Subscription)	Yes (royalty-free runtime)	This price includes Word, PDF, Excel libraries in one. Perpetual option available (~$1,199 dev, incl. 1yr updates). Unlimited deployments allowed. If part of DevExpress Universal, cost is higher but includes all products. Subscription gives access to new versions and support during the term.
Telerik Document Processing	Per-developer, bundled with UI suites (annual or perpetual options)	$1,149 per dev/year (DevCraft UI)telerik.com — or $1,299 with priority support	Yes (royalty-free)	Not sold standalone — comes with Telerik DevCraft. License covers unlimited servers. Perpetual license available (roughly ~$2,300 one-time for DevCraft UI with 1yr support). If you only need document libs, this is a costlier route unless you already plan to use Telerik UI components.
Open-Source	n/a — (MIT/Apache licensed libraries)	$0 (no license fees)	Yes (no restrictions)	Solutions like Open-XML PowerTools + PDF converter have no licensing cost. However, development and maintenance effort is the “cost” — e.g. ensuring LibreOffice or wkhtmltopdf is available and managing conversion issues. Support is community-based.

Notes: All commercial options above allow redistribution or server deployment without further fees (once you have the appropriate license type). Aspose and GroupDocs (Aspose’s sister) specifically require the OEM license for unlimited deployment — the Developer OEM at $3,597 covers that. Syncfusion, GemBox, DevExpress, Telerik, etc., include unlimited deployment in their standard licenses. SautinSoft’s individual license is an outlier with the single-server limit, so choose the correct tier for your scenario to be compliant. Open-source tools incur no license cost but may incur higher integration costs (time to implement, handle edge cases, etc.).

Open-Source Alternatives

While our focus is on ready-to-use libraries, it’s worth mentioning some open-source approaches since they can be very cost-effective if you can tolerate the extra effort:

Open XML SDK + HTML-to-PDF: One common method is to use Open-XML PowerTools (an extension of the Open XML SDK) to convert a DOCX into an HTML representation, then feed that HTML to a converter like wkhtmltopdf (via a .NET wrapper like DinkToPdf). This approach is entirely free and runs on Linux. For example, the OpenXMLPowerTools HtmlConverter can transform a .docx to an HTML string, and then DinkToPdf (which uses the WebKit engine) can generate a PDF from that HTML. This was demonstrated in a high-voted StackOverflow answer. The pros: no licensing costs, and reasonably good output for simple formatted text. The cons: fidelity loss is possible — complex Word layouts (floating images, multi-column sections, etc.) may not translate perfectly to HTML or the PDF. Images and hyperlinks can be handled (the solution references fixes needed for images in HTML), but it requires careful coding and testing. Also, you must package native binaries for wkhtmltopdf with your Function (which increases deployment size and may have startup overhead). Still, for basic documents this pipeline can work and has been used in production by those avoiding third-party costs. Maintenance is on you — e.g. if a new DOCX element isn’t handled by the converter, you may have to adjust the open-source code.
LibreOffice in Headless Mode: Another route is leveraging LibreOffice or OpenOffice to do the conversion. LibreOffice can run in command-line mode (soffice --headless --convert-to pdf --infilter="Word" etc.) to convert DOCX to PDF. This usually gives very high fidelity (close to MS Word’s output) since LibreOffice’s rendering of Word docs is quite advanced. In an Azure Function, you can’t directly install LibreOffice, but you can deploy your function as a custom Linux container that includes LibreOffice. There are examples on GitHub of Azure Functions that call LibreOffice for conversion. The process would be: Function receives the Word file (or URL), spawns a LibreOffice process to convert to PDF, then returns the PDF. The advantages: LibreOffice is free (LGPL license) and handles a lot of Word features (even some obscure ones) fairly well. Disadvantages: It’s heavyweight — LibreOffice startup might be ~0.5-1 second or more, and each invocation is not cheap CPU-wise (you might mitigate by keeping a process warm, but in Functions that’s tricky without a durable service). Also, orchestrating an external process in Azure Functions requires the consumption plan to allow it (which it does, but with time limits). Cold start of the container with LibreOffice could be slow too. This approach might be more suitable for an App Service or containerized web app than a consumption function, due to performance considerations. However, if document fidelity is paramount and you cannot afford Aspose, this is a viable hack. Community support for this would come from Linux/LibreOffice forums more than .NET forums, since at that point .NET is just a shell to call the converter.
Other Open-Source Libraries: There isn’t a fully managed .NET open-source library that converts Word to PDF with high fidelity at the level of the commercial ones. Projects like DocX (Xceed’s open source) let you create/manipulate DOCX but not convert to PDF. There was an attempt called MigraDoc (from PDFSharp) to import DOCX, but it’s not complete. GroupDocs.Editor or OpenOffice SDK in .NET are not open source. One could also consider Aspose.Words Cloud API which has a pay-as-you-go model — but that’s not self-hosted or open-source (it’s just hosted licensing). Pandoc is an open-source converter tool (can convert DOCX to PDF via LaTeX) — it could in theory be used similarly to LibreOffice (as an external process), but it may have formatting limitations. If your Word docs are relatively simple (text, basic formatting), you could even load the DOCX via ClosedXML or DocumentFormat.OpenXml and manually draw the content into a PDF using a library like QuestPDF or PdfSharp. That, however, would be a significant development effort essentially reimplementing a layout engine.

In summary, open-source methods can be zero-cost but are often labor-intensive and potentially fragile. They might be suitable if you have very specific and limited document content, or as a temporary solution while evaluating budget. For a production system that needs to handle varied Word documents reliably, the commercial libraries provide a lot of value in their robustness and support.

Conclusion

After evaluating the options, the best choice depends on the balance between budget and requirements:

If cost is the primary concern and you still need solid functionality, GemBox.Document is a standout. It offers nearly everything Aspose does for Word-to-PDF (including field updates and excellent PDF fidelity) at a fraction of the price, and its license (≈$890 one-time) includes royalty-free unlimited deployment. It runs on .NET 8/Linux with no issues (using SkiaSharp) and has been used successfully in Azure Functions. The only minor caveat is to ensure the library covers all the field types or document features you need (for instance, if you needed to update a TOC field, GemBox’s current version might not do that automatically, whereas Aspose would). But for DATE fields and general content, it’s more than capable. GemBox’s support is also well-regarded.
Syncfusion DocIO is another great choice, especially if your organization qualifies for the free community license — in which case it would cost nothing and you get a comprehensive, well-supported library. Even without the free license, Syncfusion’s ~$995 dev license (with unlimited deployment) is much cheaper than Aspose, and you get a whole suite of file format tools. It fully meets the technical criteria: high-quality PDF, field handling, .NET Core compatibility, and no additional server costs. Syncfusion might require a bit of setup for Linux (ensuring SkiaSharp or libgdiplus is present), but they have documentation for Azure scenarios. They also actively improve their libraries (any minor fidelity gaps are continually being addressed). So for a production-grade solution with a low total cost of ownership, Syncfusion is very attractive.
Spire.Doc and SautinSoft Document .NET come in slightly behind in our analysis but are still valid. They meet the core requirements and are more affordable than Aspose. Spire.Doc at ~$999 (or ~$3k for OEM) is a drop-in solution that many .NET devs have used when they can’t afford Aspose — it will do the job, but be prepared for the libgdiplus deployment step on Azure Linux. SautinSoft’s library is feature-rich and could be a hidden gem if its specific capabilities (like combined PDF+Word handling in one) appeal to you; just consider the licensing tier to avoid restrictions. Both have smaller communities, meaning you rely on vendor support for tricky issues (e-iceblue and SautinSoft are generally helpful, but maybe not as fast as the larger companies).
Xceed Words for .NET is worth highlighting if your document content is straightforward and you don’t need Word’s advanced field logic. It’s extremely cost-effective (as low as $349) and simple to deploy. In a case where you mostly merge data into a Word template and convert to PDF, Xceed can be a lightweight and budget-friendly solution. Just remember its PDF conversion isn’t as all-encompassing — if your docs have things like charts, auto-generated TOCs, or complex shapes, you might hit its limits. For basic use cases, though, it’s a production-ready library (backed by a company with long history in .NET components), and the savings are considerable.
DevExpress and Telerik solutions are top-notch in quality and come from very reputable vendors. They definitely can be used in this scenario (and if you already have those toolsets in your stack, leveraging them is a no-brainer). The only reason they’re not the first recommendation here is their cost relative to libraries like GemBox or Syncfusion when purchased solely for this purpose. DevExpress at ~$600/yr is actually not bad at all — if having a big vendor name and guaranteed support is important, it’s a strong option. Telerik’s would only make sense if you need their UI suite as well, given the pricing.
Aspose.Words, while the most feature-complete, is likely overkill for the budget-conscious. It absolutely fulfills every technical requirement: you won’t find a scenario it can’t handle (fields, formatting, odd Word quirks — Aspose usually handles them all). If money were no object, Aspose or its cloud offering would be an easy choice. But in context, its cost is an order of magnitude higher than some competitors for effectively similar outcomes in the Word-to-PDF use case. Aspose’s new $99 plugins program is interesting — for example, if they have a “Word to PDF conversion plugin” at $99/yr, that could actually invert the value proposition (making Aspose’s engine one of the cheapest). However, one must confirm how the metered usage works; often it might have a cap or charges beyond a certain number of conversions. There’s also the consideration of support/community: Aspose has a very active forum and knowledge base (which is a plus), but so do Syncfusion and others nowadays.
Open-source approach (OpenXML + converter) remains an option if all commercial licensing is off the table. It can be made to work in Azure Functions (many have done similar things). But be prepared to invest time to handle issues (for example, you might need to embed fonts manually for the PDF converter, or fix HTML for images as demonstrated in the code sample). It’s also harder to maintain as requirements grow (say you later need to support DOC format or update a field — you’d have to implement those). The upside is you avoid licensing entirely and keep the deployment relatively small (OpenXML SDK is lightweight; wkhtmltopdf headless isn’t tiny, but manageable). In a production scenario where reliability and fidelity are important, most teams find that a proven library (with support) pays for itself quickly, which is why we focus on those above.

Conclusion: For a typical Azure Functions project that needs to load Word docs, tweak DATE fields (perhaps to a specific date or format), and output PDFs with proper layout, a cost-effective, production-ready solution would be GemBox.Document or Syncfusion DocIO, depending on your licensing preference. Both provide strong field manipulation and high-quality PDF conversion on .NET 8/Linux, without the need for any Windows components. GemBox.Document in particular gives you Aspose-like capabilities at ~1/4 the cost and has a straightforward royalty-free license. Syncfusion gives you a whole toolkit (and possibly free usage) and is also battle-tested in web/server environments.

If your budget is extremely tight and you only need basic functionality, Xceed Words (DocX) or an open-source workflow can suffice, just with the understanding of their limitations. And if your needs are very advanced (complex fields, highest fidelity) and you can justify it, Aspose.Words remains the “Cadillac” option. But in 2025, the landscape has multiple competitive offerings that meet the criteria without breaking the bank. By choosing one of those, you can achieve the required Word-to-PDF conversion with correct date field handling, accurate formatting, cross-platform compatibility, and scalable licensing — all within a reasonable budget for your Azure Functions deployment.

The post Document Processing Libraries for Word to PDF on .NET 8 (Linux Azure Functions) appeared first on prodSens.live.

AWS Compute Showdown: ECS vs. EKS vs. Fargate vs. Lambda – Choosing Your Champion

Greg Heilers — Tue, 29 Apr 2025 04:21:18 +0000

Intro : Start with a relatable problem, real-world scenario, or surprising insight

You’ve built your application. It works beautifully on your local machine. Now comes the “fun” part: deploying it to the cloud. You log into the AWS console, navigate to compute services, and… bam. You’re hit with a wall of acronyms: ECS, EKS, Fargate, Lambda. They all seem to run code, but they feel vastly different. Which one is right for your workload? Choose wisely, and you unlock scalability, cost-efficiency, and operational bliss. Choose poorly, and you might face spiraling costs, operational headaches, or an architecture that fights you every step of the way. It’s a common crossroads for developers and architects, and navigating it effectively is crucial.

Why It Matters: Briefly explain why the topic is relevant today

In today’s cloud-native world, how you run your code is just as important as the code itself. The right compute service impacts everything:

Cost: Pay only for what you need, or pay for idle servers?
Scalability: Scale seamlessly from zero to millions of requests, or manually provision capacity?
Operational Overhead: Manage servers, patching, and scaling, or let AWS handle the heavy lifting?
Development Velocity: Focus on writing application code, or spend time managing infrastructure?
Architecture: Enable modern patterns like microservices and event-driven architectures, or stick with traditional monoliths?

Understanding these fundamental compute options is no longer optional; it’s a core competency for anyone building on AWS. Getting this choice right accelerates innovation and optimizes resources. Getting it wrong creates drag.

The Concept in Simple Terms: Introduce the AWS service or concept using a metaphor or analogy

Let’s use a transportation analogy to understand the different levels of abstraction and management:

AWS Lambda: Think of this as a Taxi or Ride-Sharing Service (like Uber/Lyft).
- You just tell it where you want to go (your code/function) and when (the trigger/event).
- You pay per trip (invocation time + requests).
- You never worry about buying the car, insurance, gas, maintenance, or even finding a parking spot. The service handles everything infrastructure-related.
- Ideal for short, specific journeys initiated by an event.
AWS Fargate: This is like a Managed Car Rental Service.
- You specify the type of car you need (CPU/Memory for your container).
- You drive it wherever you want (run your containerized application).
- You pay for the duration you’re using the car (vCPU/Memory per second).
- You don’t worry about oil changes, tire rotations, or engine trouble (managing the underlying server). The rental company (AWS) handles the fleet maintenance.
- Ideal for running containers without managing the underlying EC2 instances.
AWS ECS (Elastic Container Service) on EC2: This is like Leasing and Managing Your Own Fleet of Delivery Vans.
- You choose the specific models of vans (EC2 instance types).
- You decide how many vans you need and manage their assignments (cluster scaling, task placement).
- You hire the drivers (your containerized applications).
- You’re responsible for van maintenance (patching EC2 instances, OS updates), refueling (managing instance resources), and optimizing routes (task placement strategies). The leasing company (AWS via ECS Control Plane) provides the framework and some high-level management tools.
- Ideal when you need granular control over the underlying instances for specific compliance, performance, or cost optimization reasons (like using Spot Instances effectively).
AWS EKS (Elastic Kubernetes Service) on EC2: This is like Building and Operating Your Own Custom Logistics Network using Industry-Standard Trucks and Protocols.
- You’re adopting a powerful, standardized system (Kubernetes) recognized across the industry.
- You choose your trucks (EC2 instances) and build your depots (configure the cluster).
- You manage the entire logistics operation (deployments, networking, scaling, security) using Kubernetes tools (kubectl, Helm, etc.).
- AWS manages the Kubernetes control plane (the dispatch center’s core systems), but you’re responsible for the worker nodes (the trucks and drivers) and everything running on them.
- Ideal when you’re committed to the Kubernetes ecosystem, need multi-cloud portability, or require specific Kubernetes features and integrations.

Crucial Distinction: Control Plane vs. Data Plane

Control Plane: The “brain” that manages the state, scheduling, and orchestration (e.g., ECS Schedulers, EKS API Server). For ECS and EKS, AWS manages this for you.
Data Plane: Where your actual code/containers run (the “muscle”).
- Lambda & Fargate: AWS manages the data plane (serverless). You don’t see or touch underlying servers.
- ECS on EC2 & EKS on EC2: You manage the data plane (the EC2 instances). You are responsible for patching, scaling, securing, and optimizing these instances.
- ECS on Fargate & EKS on Fargate: You use the ECS/EKS control plane but run your containers on the AWS-managed Fargate data plane. A hybrid approach!

Deeper Dive: Transition to a more technical explanation

Let’s break down each service with more technical detail:

AWS Lambda

What it is: A serverless, event-driven compute service. You upload your code (as functions), and Lambda runs it in response to triggers (API Gateway requests, S3 events, DynamoDB streams, etc.), automatically managing the underlying compute resources.
Core Concept: Function-as-a-Service (FaaS). Pay per invocation and compute time (measured in milliseconds). Scales automatically from zero to thousands of requests per second.
When to Choose:
- Event-driven architectures (processing file uploads, reacting to database changes).
- APIs (especially microservices via API Gateway).
- Background tasks & scheduled jobs (Cron jobs).
- Real-time data processing.
- Workloads with highly variable traffic (including zero).
Best Architectures: Microservices, Event Sourcing, APIs, Data Processing Pipelines.
Challenges:
- Cold Starts: Initial latency when a function hasn’t been invoked recently. (Mitigated by Provisioned Concurrency).
- Execution Limits: Maximum runtime duration (currently 15 minutes).
- Statelessness: Functions should ideally be stateless; state needs external management (DynamoDB, S3, ElastiCache).
- Vendor Lock-in: Code might need refactoring to run outside Lambda’s environment.
- Debugging/Monitoring: Can be complex across distributed functions.
Cost Optimization:
- Optimize function memory size (cost is proportional to memory * duration).
- Use Graviton2 (Arm) processors for better price/performance.
- Implement efficient code to reduce execution time.
- Use Provisioned Concurrency strategically for latency-sensitive workloads (costs more).
- Leverage Compute Savings Plans.

AWS Fargate

What it is: A serverless compute engine for containers. It works with both ECS and EKS. You define your container requirements (CPU, memory), package your application in a container, and Fargate launches and manages the infrastructure for you.
Core Concept: Serverless Containers. Pay per vCPU and memory resources consumed by your containerized application, per second. No instances to manage.
When to Choose:
- Running containerized applications without wanting to manage EC2 instances.
- Microservices architectures where operational simplicity is key.
- Web applications, APIs, batch processing jobs run via containers.
- Migrating containerized applications quickly without infrastructure overhead.
- When you prefer a container orchestrator (ECS/EKS) but want a serverless data plane.
Best Architectures: Microservices, Web Applications, APIs, Containerized Batch Jobs.
Challenges:
- Limited Control: Less control over the underlying environment compared to EC2 (no custom AMIs, limited OS-level access).
- Networking: Can be slightly more complex to configure VPC networking initially compared to Lambda.
- Cost: Can be more expensive than EC2 if workloads are very steady-state and instances are highly utilized (especially with Reserved Instances or Savings Plans on EC2).
- Specific Instance Features: Can’t use instance types with specialized hardware like GPUs directly (though options are evolving).
Cost Optimization:
- Right-size tasks: Accurately define CPU/memory needs.
- Use Fargate Spot: Up to 70% discount for fault-tolerant workloads.
- Use Graviton2 (Arm) processors: Better price/performance.
- Compute Savings Plans: Commit to usage for discounts.
- Scale-to-zero (with ECS): Can scale services down to zero tasks when idle (though event-driven scaling might require custom solutions or App Runner).

AWS ECS (Elastic Container Service)

What it is: A highly scalable, high-performance container orchestration service that supports Docker containers. It allows you to easily run, stop, and manage containers on a cluster.
Core Concept: AWS-Native Container Orchestration. Simpler API and integration with AWS services compared to Kubernetes (EKS).
Launch Types:
- EC2: You manage the underlying EC2 instances in the cluster (patching, scaling the cluster itself). Offers maximum control.
- Fargate: AWS manages the underlying compute (serverless, as described above).
When to Choose (ECS on EC2):
- You need fine-grained control over EC2 instances (specific instance types, GPUs, custom AMIs, stricter compliance).
- You want maximum cost optimization potential via Spot Instances and Reserved Instances/Savings Plans with high utilization.
- Your team is deeply familiar with the AWS ecosystem and prefers native integrations.
- You need features like Windows Containers on EC2 (Fargate support is newer/limited).
When to Choose (ECS on Fargate): Same reasons as standalone Fargate, but using the ECS control plane.
Best Architectures: Microservices, Web Applications, Batch Processing, Monoliths (containerized).
Challenges (ECS on EC2):
- Instance Management: You are responsible for the EC2 cluster nodes (OS patching, security updates, scaling the cluster instances, managing Docker agent).
- Cluster Capacity Management: Need to ensure you have enough EC2 capacity for your tasks (Can be automated with Cluster Auto Scaling and Capacity Providers).
- Operational Overhead: Higher than Fargate or Lambda.
Cost Optimization (ECS on EC2):
- Right-size instances and tasks.
- Utilize Spot Instances heavily via Capacity Providers.
- Use Reserved Instances or Savings Plans for baseline capacity.
- Optimize task placement strategies (bin packing).
- Use Graviton2 (Arm) instances.

AWS EKS (Elastic Kubernetes Service)

What it is: A managed service that makes it easy to run Kubernetes on AWS without needing to install, operate, and maintain your own Kubernetes control plane.
Core Concept: Managed Kubernetes. Provides upstream, certified Kubernetes conformance. Leverages the vast Kubernetes ecosystem (tools, plugins, community support).
Launch Types (Data Plane):
- EC2 (Managed Node Groups / Self-Managed Nodes): You manage the EC2 worker nodes. Managed Node Groups automate provisioning and lifecycle management.
- Fargate: Run Kubernetes pods on serverless infrastructure managed by AWS.
When to Choose:
- Your organization is standardized on Kubernetes.
- You need portability across clouds or on-premises (using Kubernetes).
- You want to leverage the extensive Kubernetes ecosystem (Helm, Istio, Argo CD, etc.).
- You have complex networking or policy requirements best met by Kubernetes primitives (Network Policies, etc.).
- Your team has existing Kubernetes expertise.
Best Architectures: Microservices, Complex Web Applications, Hybrid Cloud Deployments, Platforms built on Kubernetes.
Challenges:
- Complexity: Kubernetes itself has a steep learning curve. EKS simplifies the control plane, but managing applications and worker nodes still requires Kubernetes knowledge.
- Operational Overhead (EKS on EC2): Similar to ECS on EC2, you manage the worker nodes (though Managed Node Groups help).
- Cost: Control plane cost per cluster + data plane costs (EC2 or Fargate). Can be more expensive than ECS for simpler use cases.
- Upgrade Cycles: Keeping up with Kubernetes versions requires planning for cluster and node upgrades.
Cost Optimization:
- Right-size nodes and pods.
- Use EC2 Spot Instances effectively (e.g., with Karpenter or Cluster Autoscaler).
- Use Reserved Instances or Savings Plans for stable worker nodes.
- Use Fargate Spot for stateless pods.
- Optimize pod density on nodes (bin packing).
- Use Graviton2 (Arm) nodes.
- Shut down non-production clusters when not needed.

Practical Example or Use Case: Deploying a Simple REST API

Let’s imagine deploying a standard Node.js REST API backend:

Lambda + API Gateway:
- Write your API logic as Lambda functions (one per endpoint or a monolith function with routing).
- Define an API Gateway endpoint to trigger these functions via HTTP requests.
- Deployment: Zip your code, upload to Lambda, configure triggers. Use AWS SAM or Serverless Framework for automation.
- Pros: Scales to zero, pay-per-request, minimal operational overhead.
- Cons: Potential cold starts, 15-min execution limit.
Fargate (with ECS or EKS):
- Package your Node.js app into a Docker container.
- Define an ECS Task Definition or Kubernetes Deployment specifying the container image, CPU/Memory.
- Define an ECS Service or Kubernetes Service (with a Load Balancer).
- Deployment: Push container image to ECR, deploy the service definition using AWS CLI, Console, CDK, Terraform, or kubectl/Helm.
- Pros: Serverless infrastructure, familiar container workflow, good for long-running processes.
- Cons: Slightly higher baseline cost than Lambda (runs continuously unless scaled to zero), networking setup.
ECS on EC2:
- Same containerization process as Fargate.
- Provision an ECS Cluster with EC2 instances (choose type, size, configure auto-scaling for the cluster).
- Deploy ECS Task Definition and Service, specifying the EC2 launch type.
- Deployment: Similar to Fargate, but you also manage the EC2 instances (OS updates, agent updates).
- Pros: Full control over instances, potential for cost savings with Spot/RIs.
- Cons: Infrastructure management overhead (patching, scaling instances).
EKS on EC2:
- Same containerization process.
- Set up an EKS cluster (AWS manages control plane).
- Provision worker nodes (Managed Node Groups or self-managed EC2).
- Define Kubernetes Deployment, Service, Ingress using YAML manifests.
- Deployment: Use kubectl apply -f .yaml or Helm charts. Manage worker node scaling and updates.
- Pros: Kubernetes standard, powerful ecosystem, portability.
- Cons: Highest complexity, requires Kubernetes expertise, infrastructure management for nodes.

Common Mistakes or Misunderstandings

Using EKS “Just Because”: Choosing EKS for simple applications when ECS or Fargate would be far simpler and cheaper, simply because Kubernetes is popular.
Ignoring Fargate Costs: Assuming Fargate is always cheaper because it’s “serverless.” While it removes operational overhead, sustained high-utilization workloads can be cheaper on optimized EC2 (ECS/EKS on EC2 with Spot/RIs). Model your costs!
Treating Lambda like a Server: Trying to run long-running, stateful processes in Lambda functions instead of using containers (ECS/EKS/Fargate) or dedicated services.
Underestimating EC2 Management (ECS/EKS on EC2): Forgetting the ongoing effort needed for patching OS, updating container runtimes, managing security groups, and scaling the instances themselves.
Not Right-Sizing: Over-provisioning CPU/Memory for Fargate tasks or Lambda functions, or using oversized EC2 instances, leading to wasted spend.
Confusing Control Plane vs. Data Plane: Not understanding who manages what (especially the difference between EKS managing the control plane vs. you managing worker nodes on EC2).

Pro Tips & Hidden Features

ECS:
- Capacity Providers: Simplify mixing EC2 Spot and On-Demand instances for cost savings and reliability.
- ECS Exec: Securely SSH or run commands directly inside running containers for debugging (use with caution!).
- CloudWatch Container Insights: Detailed performance monitoring and diagnostics for ECS (and EKS/Fargate).
EKS:
- Karpenter: An open-source, flexible, high-performance Kubernetes cluster autoscaler built by AWS that can rapidly provision right-sized nodes. Often preferred over the standard Cluster Autoscaler.
- Managed Node Groups: Let AWS handle node provisioning, upgrades, and termination, reducing operational burden.
- Helm: Use Helm charts to package, deploy, and manage Kubernetes applications easily.
- EKS Blueprints: Use CDK or Terraform blueprints to quickly provision complete, opinionated EKS clusters.
Fargate:
- Fargate Spot: Essential for cost savings on fault-tolerant container workloads. Combine with On-Demand for reliability.
- Seekable OCI (SOCI): An AWS open-source technology that can speed up container launch times on Fargate by lazy-loading the image layers. (Check for current support status).
Lambda:
- Provisioned Concurrency: Pre-warm function instances to eliminate cold starts for latency-sensitive applications (at extra cost).
- Lambda Extensions: Integrate monitoring, security, and governance tools directly into the Lambda execution environment.
- AWS Lambda Powertools: Language-specific libraries (Python, Java, TypeScript, .NET) to help implement observability best practices (tracing, structured logging, metrics).
General:
- Graviton Processors (Arm64): Often provide significant price/performance benefits across EC2, Fargate, and Lambda. Compile/build your code/containers for Arm64.
- Infrastructure as Code (IaC): Use AWS CDK, Terraform, or CloudFormation to define and manage all these resources reliably and repeatably.
- Cost Allocation Tags: Tag everything meticulously to understand costs per service, team, or feature.

Simple Code Snippet / CLI Command Example

Here’s a simple AWS CLI command to list your running ECS tasks on a specific cluster (illustrative):

# Make sure you have AWS CLI configured
# Replace 'your-cluster-name' with your actual ECS cluster name

aws ecs list-tasks --cluster your-cluster-name

# To get more details about a specific task (replace task-arn):
# aws ecs describe-tasks --cluster your-cluster-name --tasks arn:aws:ecs:region:account-id:task/cluster-name/task-id

This just gives a taste – managing these services often involves more complex configurations via IaC tools.

Final Thoughts + Call to Action

Choosing between Lambda, Fargate, ECS, and EKS isn’t about finding the single “best” service – it’s about finding the right fit for your specific application needs, team expertise, operational tolerance, and cost sensitivity.

Need ultimate simplicity for event-driven tasks? Lambda.
Want to run containers without managing servers? Fargate (via ECS or EKS).
Need container orchestration with deep control over instances or maximum EC2 cost optimization? ECS on EC2.
Committed to the Kubernetes ecosystem and need its power and portability? EKS (on EC2 or Fargate).

The lines can blur (e.g., ECS/EKS on Fargate), offering hybrid approaches. The best way to truly understand is to experiment. Spin up a small test application on each relevant service. Monitor its performance, check the costs, and experience the developer workflow.

What are your experiences? Which service is your go-to, and why? Did I miss any crucial considerations or pro tips? Share your thoughts in the comments below – let’s learn from each other!

The post AWS Compute Showdown: ECS vs. EKS vs. Fargate vs. Lambda – Choosing Your Champion appeared first on prodSens.live.

Streaming de datos Serverless con Aurora DSQL

Bethan Vincent — Mon, 24 Mar 2025 23:21:17 +0000

La gestión y procesamiento de datos en tiempo real es una necesidad creciente en la actualidad. En este artículo, exploraremos cómo construir una solución serverless para procesar archivos CSV y cargarlos automáticamente en Aurora DSQL utilizando servicios de AWS.

Introducción
El streaming de datos permite procesar información en tiempo real, tan pronto como se genera. En este caso, implementaremos un sistema que detecta cuando se carga un archivo CSV en un bucket de S3 y automáticamente procesa y carga los datos en una base de datos Aurora DSQL.

Esta solución combina varios servicios de AWS:

Amazon S3 para almacenamiento de archivos.
AWS Lambda para procesamiento serverless.
Amazon EventBridge para eventos y comunicación entre servicios.
Amazon Aurora DSQL para el almacenamiento de datos.
AWS SAM para la implementación de infraestructura como código

Arquitectura de la solución

La arquitectura se compone de:

Un bucket S3 donde se cargan los archivos CSV.
EventBridge que detecta la carga de nuevos archivos.
Una función Lambda que procesa los archivos y valida los datos.
Aurora DSQL donde se almacenan los datos procesados.

Implementación con AWS SAM
AWS SAM (Serverless Application Model) nos permite definir toda nuestra infraestructura como código. Veamos cómo se estructura nuestro template:

AWSTemplateFormatVersion: '2010-09-09'
Transform: AWS::Serverless-2016-10-31
Description: >
  streaming-dsql

  Sistema serverless para streaming de datos a Aurora DSQL

Globals:
  Function:
    Timeout: 5
    MemorySize: 128
    Runtime: python3.12
    Layers:
      - !Sub arn:aws:lambda:${AWS::Region}:017000801446:layer:AWSLambdaPowertoolsPythonV3-python312-x86_64:7

Parameters:
  ClusterId:
    Description: Aurora DSQL Cluster Id
    Type: String
  BucketName:
    Type: String
    Description: Nombre del bucket de S3

Resources:
  StreamingFunction:
    Type: AWS::Serverless::Function
    Properties:
      CodeUri: lambdas/streaming_dsql/
      Handler: app.lambda_handler
      Architectures:
        - x86_64
      Policies:
        - Version: '2012-10-17'
          Statement:
            - Sid: DsqlDataAccess
              Effect: Allow
              Action:
                - dsql:DbConnectAdmin
              Resource: 
                - !Sub arn:aws:dsql:${AWS::Region}:${AWS::AccountId}:cluster/${ClusterId}
            - Sid: S3GetObject
              Effect: Allow
              Action:
                - s3:GetObject
              Resource: 
                - !Sub arn:aws:s3:::${BucketName}/*
      Events:
        S3EventBridgeRule:
          Type: EventBridgeRule
          Properties:
            Pattern:
              source:
                - aws.s3
              detail-type:
                - "Object Created"
              detail:
                bucket:
                  name:
                    - !Ref BucketName
      Environment:
        Variables:
          POWERTOOLS_SERVICE_NAME: StreamingDsql
          POWERTOOLS_METRICS_NAMESPACE: Powertools
          LOG_LEVEL: INFO
          REGION: !Ref AWS::Region
          DSQL_CLUSTER_ENDPOINT: !Sub "${ClusterId}.dsql.${AWS::Region}.on.aws"
          DATA_BUCKET: !Ref BucketName

  Bucket:
    Type: AWS::S3::Bucket
    Properties:
      BucketName: !Ref BucketName
      NotificationConfiguration:
        EventBridgeConfiguration:
          EventBridgeEnabled: true

Este template define la función Lambda con permisos para acceder a S3 y Aurora DSQL, así como el bucket S3 con notificaciones EventBridge habilitadas.

Procesamiento de datos con AWS Lambda y Powertools
Para la implementación de la función Lambda utilizamos AWS Lambda Powertools, una biblioteca que facilita la implementación de buenas prácticas como la validación de datos, métricas y trazas.

El repo completo encuentran en GitHub

Validación de datos con JSON Schema
Una parte crucial del proceso es la validación de datos. Para esto, utilizamos JSON Schema a través de las utilidades de validación de AWS Lambda Powertools.

Powertools for AWS Lambda: Validation

Esta validación garantiza que los datos cumplan con nuestros requisitos antes de insertarlos en la base de datos.

Despliegue de la solución
Para desplegar nuestra solución, utilizamos los comandos de AWS SAM:

# Empaquetar la aplicación
sam build

# Desplegar la aplicación
sam deploy --guided

Para instalar AWS SAM CLI, sigue los pasos en la guía oficial de instalación.

Durante el despliegue guiado, se nos pedirá que proporcionemos los valores para nuestros parámetros, como el ID del clúster de Aurora DSQL y el nombre del bucket de S3.

Ventajas de esta arquitectura
Esta solución serverless ofrece varias ventajas:

Escalabilidad automática: AWS Lambda escala automáticamente según la carga de trabajo.
Sin servidores que administrar: No hay infraestructura que gestionar.
Procesamiento en tiempo real: Los datos se procesan tan pronto como se cargan.
Alta disponibilidad: Los servicios de AWS son altamente disponibles.
Costo-eficiente: Solo pagas por lo que usas.
Validación robusta: La validación de datos garantiza la calidad de la información.

Consideraciones de seguridad

Se utilizan permisos de IAM restringidos para la función Lambda.
La conexión a Aurora DSQL se realiza mediante tokens temporales.
La comunicación con Aurora DSQL se realiza mediante SSL.
Los datos se validan antes de ser insertados en la base de datos.

Conclusión
La combinación de servicios Serverless de AWS nos permite construir soluciones robustas para el streaming de datos. Con AWS SAM, podemos definir toda nuestra infraestructura como código, lo que facilita la implementación y el mantenimiento.

Aurora DSQL nos proporciona una base de datos compatible con PostgreSQL con la ventaja de ser Serverless, lo que nos permite escalar según nuestras necesidades sin tener que preocuparnos por la infraestructura desplegada.

Con esta arquitectura, podemos procesar grandes volúmenes de datos en tiempo real, validarlos y almacenarlos de manera eficiente, todo con un mínimo esfuerzo de administración.

The post Streaming de datos Serverless con Aurora DSQL appeared first on prodSens.live.

Deploying Serverless Functions Across Regions with AWS Lambda

Kirsten Jepson — Wed, 12 Feb 2025 01:20:56 +0000

Providing seamless user experiences across different regions is crucial in a globalized digital world. Deploying AWS Lambda functions in multiple regions ensures low latency, high availability, and better fault tolerance. This post will explore deploying serverless functions effectively across AWS regions, focusing on best practices, automation strategies, and a real-world example.

Why Deploy AWS Lambda Functions Across Regions?

Deploying serverless functions across multiple AWS regions offers several advantages:

Reduced Latency: Users in different geographic locations experience faster response times.
Improved Availability: Ensures business continuity in case of regional failures.
Compliance & Data Sovereignty: Some applications require region-specific processing due to legal and regulatory requirements.
Scalability & Redundancy: Balances workloads and provides failover mechanisms in case of outages.

Key Strategies for Multi-Region AWS Lambda Deployment

To efficiently deploy AWS Lambda functions across multiple regions, follow these best practices:

1. Use Infrastructure as Code (IaC) with AWS CloudFormation or Terraform

Managing multi-region deployments manually can be error-prone and inefficient. Using IaC tools like CloudFormation or Terraform allows you to:

Define Lambda functions, API Gateway endpoints, IAM roles, and other resources in code.
Maintain consistent deployments across regions.
Automate rollbacks and version control.

Example Terraform Code for Multi-Region Deployment

provider "aws" {
  region = "us-east-1"
}

resource "aws_lambda_function" "lambda_us" {
  function_name    = "reservation-processor"
  handler         = "index.handler"
  runtime         = "nodejs18.x"
  role            = aws_iam_role.lambda_exec.arn
  filename        = "lambda.zip"
}

provider "aws" {
  alias  = "eu"
  region = "eu-west-1"
}

resource "aws_lambda_function" "lambda_eu" {
  provider        = aws.eu
  function_name   = "reservation-processor"
  handler         = "index.handler"
  runtime         = "nodejs18.x"
  role            = aws_iam_role.lambda_exec.arn
  filename        = "lambda.zip"
}

2. Implement CI/CD Pipelines with AWS CodePipeline or GitHub Actions

Automating deployments ensures consistency and reduces manual errors. A CI/CD pipeline:

Deploys Lambda functions to multiple regions automatically.
Allows rollbacks in case of failures.
Ensures version control and controlled releases.

Example GitHub Actions Workflow for Multi-Region Deployment

name: Deploy Multi-Region Lambda

on:
  push:
    branches:
      - main

jobs:
  deploy:
    runs-on: ubuntu-latest
    steps:
      - name: Checkout Code
        uses: actions/checkout@v3

      - name: Deploy to US-East-1
        run: aws lambda update-function-code --function-name reservation-processor --zip-file fileb://lambda.zip --region us-east-1

      - name: Deploy to EU-West-1
        run: aws lambda update-function-code --function-name reservation-processor --zip-file fileb://lambda.zip --region eu-west-1

3. Use AWS Lambda Versions and Aliases for Controlled Releases

AWS Lambda allows function versioning and aliasing for better deployment control. You can:

Maintain multiple versions of a function.
Use aliases like production, staging, or beta to route traffic gradually.
Implement blue-green deployments to minimize downtime.

Example AWS CLI Commands for Versioning and Aliases

# Publish a new version
aws lambda publish-version --function-name reservation-processor --region us-east-1

# Create an alias pointing to the new version
aws lambda create-alias --function-name reservation-processor --name production --function-version 2 --region us-east-1

Case Study: Travel Booking Application

In my last post, we discussed a global travel booking platform that processes flight and hotel reservations. To provide fast and reliable service, the company would need to deploy its reservation-processing Lambda function in both North America (us-east-1) and Europe (eu-west-1).

Architecture Breakdown:

API Gateway routes requests to the closest region using latency-based routing via Amazon Route 53.
Lambda functions in both regions handle reservation processing.
DynamoDB Global Tables ensure real-time data replication across regions.
CloudWatch and X-Ray provide monitoring and tracing for performance insights.

Benefits for Users:

A traveler booking a flight from New York gets routed to us-east-1, experiencing low latency.
A traveler booking from London gets routed to eu-west-1, ensuring fast processing.
In case us-east-1 goes down, requests automatically fail over to eu-west-1.

Conclusion

Deploying AWS Lambda functions across multiple regions enhances performance, availability, and compliance for global applications. By leveraging Infrastructure as Code, CI/CD automation, and version control, you can ensure a scalable and resilient architecture.

In the next post, we’ll explore global API management using API Gateway and Route 53 to efficiently direct user traffic across regions.

The post Deploying Serverless Functions Across Regions with AWS Lambda appeared first on prodSens.live.

Exploring Serverless Functions: A Beginner’s Guide to AWS Lambda and Vercel

Noorisingh Saini — Sat, 08 Feb 2025 09:20:26 +0000

Imagine building scalable applications without managing servers. No more worrying about infrastructure, scaling, or maintenance—just write your code and deploy! That’s the power of serverless functions.

In this beginner-friendly guide, we’ll explore AWS Lambda and Vercel Functions, two of the most popular serverless computing platforms. Whether you’re building an API, automating tasks, or handling backend logic, serverless functions can simplify development and reduce costs. Let’s dive in!

What Are Serverless Functions?

Serverless functions are small, event-driven pieces of code that run on-demand without the need to manage servers.

Scalability – Automatically scales with demand.
Cost-Effective – Pay only for what you use.
No Server Management – Focus on writing code, not infrastructure.
Fast Deployments – Deploy functions in seconds.

Real-World Uses:

REST APIs & GraphQL endpoints

Image & video processing

User authentication

Background tasks (e.g., sending emails, notifications)

Serverless cron jobs

AWS Lambda: The Powerhouse of Serverless

What is AWS Lambda?

AWS Lambda is Amazon’s serverless compute service that runs code in response to events like HTTP requests, file uploads, or database changes.

Key Features:

Supports Multiple Languages (Node.js, Python, Go, Java, etc.)
Trigger from Various AWS Services (S3, DynamoDB, API Gateway, etc.)
Pay-Per-Use Pricing – No idle server costs
Automatic Scaling – Handles millions of requests seamlessly

How to Deploy a Simple AWS Lambda Function (Node.js Example)

1⃣ Create an AWS Account
2⃣ Go to AWS Lambda Console and click “Create Function”
3⃣ Choose “Author from scratch”
4⃣ Select Runtime (e.g., Node.js 18.x)
5⃣ Write Your Function:

exports.handler = async (event) => {
return {
statusCode: 200,
body: JSON.stringify({ message: “Hello from AWS Lambda!” })
};
};

6⃣ Deploy and Test!

Pro Tip: Use AWS API Gateway to expose Lambda as a REST API.

Vercel Functions: The Simplest Way to Go Serverless

What is Vercel?

Vercel is a frontend-first cloud platform that includes serverless backend capabilities. It’s perfect for Next.js, React, and JAMstack applications.

Key Features:

Instant Deployments – Just push to GitHub
Native Next.js Support – Serverless functions run directly within your Next.js app
Edge Functions – Run code closer to users for ultra-low latency
Free Tier Available – Great for small projects

How to Create a Vercel Serverless Function (API Route Example)

If you’re using Next.js, Vercel Functions work out of the box inside the /api folder.

1⃣ Install Vercel CLI (Optional but Recommended)

npm install -g vercel

2⃣ Create a Serverless Function in Next.js (/pages/api/hello.js)

export default function handler(req, res) {
res.status(200).json({ message: “Hello from Vercel!” });
}

3⃣ Deploy Instantly with Vercel CLI:

vercel deploy

4⃣ Your API is Live! Access it at https://your-app.vercel.app/api/hello

Pro Tip: Vercel Functions are great for handling auth, payments, and external API calls.

AWS Lambda vs. Vercel Functions: Which One Should You Use?

Summary:

Choose AWS Lambda for enterprise-level apps and deep AWS integration.

Choose Vercel for fast, frontend-friendly deployments with minimal setup.

Final Thoughts: The Future is Serverless

Serverless functions are revolutionizing web development by offering scalability, cost efficiency, and fast deployments. Whether you’re building a full-stack app, API, or automation tool, AWS Lambda and Vercel give you the power of the cloud—without the complexity.

Ready to go serverless? Start small with Vercel Functions or explore AWS Lambda for large-scale applications. Either way, the future of development is here!

Serverless #AWSLambda #Vercel #CloudComputing #WebDevelopment #NextJS #MERNStack #BackendDevelopment #JavaScript #NodeJS

The post Exploring Serverless Functions: A Beginner’s Guide to AWS Lambda and Vercel appeared first on prodSens.live.

Private API Gateway as EventBridge API Destination

Katherine Boyarsky — Tue, 21 Jan 2025 08:20:42 +0000

In a previous post, I explained how to connect AWS Step Functions to a private API Gateway endpoint thanks to the new integration with AWS PrivateLink and Amazon VPC Lattice. In this issue, I’ll show you how to use the same integration to use a private API Gateway API as an EventBridge target using the CDK, removing the need for an intermediary Lambda function.

Overview

Source

The setup is similar to the one for Step Functions. A Resource Gateway is used as the entry point into the VPC. It is associated with a Resource Configuration, which defines the Private API Gateway resource, and the EventBridge Connection is configured to use the Resource Config as the final destination.

For more details about this setup, see my previous post about the Step Functions Integration.

CDK Stack Definition

We need to define the Resource Gateway and the Resource Definition.

    // Security Group for the Resource Gateway
    const rgSecurityGroup = new SecurityGroup(this, 'ResourceGatewaySG', {
      vpc: vpc,
      allowAllOutbound: false,
    });

    rgSecurityGroup.addEgressRule(
      Peer.ipv4(vpc.vpcCidrBlock),
      Port.tcp(443),
      'Allow HTTPS traffic from Resource Gateway',
    );

    // Resource Gateway
    const resourceGateway = new CfnResourceGateway(this, 'ResourceGateway', {
      name: 'private-api-access',
      ipAddressType: 'IPV4',
      vpcIdentifier: vpc.vpcId,
      subnetIds: vpc.isolatedSubnets.map((subnet) => subnet.subnetId),
      securityGroupIds: [rgSecurityGroup.securityGroupId],
    });

    // Resource Configuration
    const resourceConfig = new CfnResourceConfiguration(
      this,
      'ResourceConfig',
      {
        name: 'sf-private-api',
        portRanges: ['443'],
        resourceGatewayId: resourceGateway.ref,
        resourceConfigurationType: 'SINGLE',
      },
    );

    // Use the global DNS name of the API gateway's VPC endpoint
    // in the Resource Configuration
    resourceConfig.addPropertyOverride(
      'ResourceConfigurationDefinition.DnsResource',
      {
        DomainName: Fn.select(
          1,
          Fn.split(':', Fn.select(0, api.vpcEndpoint.vpcEndpointDnsEntries)),
        ),
        IpAddressType: 'IPV4',
      },
    );

    // Event Bus
    const eventBus = new EventBus(this, 'EventBus', {});

    // Connection to the API
    const connection = new Connection(this, 'ApiConnection', {
      authorization: Authorization.apiKey(
        'x-api-key',
        SecretValue.unsafePlainText('demo'),
      ),
    });

    // Setup the Connection with the Resouce Config
    (connection.node.children[0] as CfnConnection).addPropertyOverride(
      'InvocationConnectivityParameters',
      {
        ResourceParameters: {
          ResourceConfigurationArn: resourceConfig.attrArn,
        },
      },
    );

EventBridge is now able to connect to the private API Gateway. We can now create a rule and set the API as the target.

    const rule = new Rule(this, 'RequestAccountRule', {
      eventBus,
      eventPattern: {
        source: ['my-source'],
      },
    });

    const apiDestination = new ApiDestination(this, 'ApiDestination', {
      endpoint: `${api.api.url}/hello`,
      httpMethod: HttpMethod.POST,
      connection: connection,
    });

    rule.addTarget(
      new targets.ApiDestination(apiDestination, {
        event: RuleTargetInput.fromEventPath('$.detail'),
      }),
    );

Find the full code on GitHub.

Testing the Integration

Putting the following event on the bus.

{
  "DetailType": "somethingHappened",
  "Source": "my-source",
  "EventBusName":"EventBusVendingMachine308DEFEB",
  "Detail": {
    "foo": "bar"
  }
}

I can see that the Lambda function used as the handler of the endpoint is invoked with the following event.

{
    "resource": "https://dev.to/hello",
    "path": "https://dev.to/hello",
    "httpMethod": "POST",
    "headers": {
        "Accept-Encoding": "gzip, x-gzip, deflate, br",
        "Content-Type": "application/json; charset=utf-8",
        "Host": "899aggxh3a.execute-api.us-east-1.amazonaws.com",
        "Range": "bytes=0-1048575",
        "User-Agent": "Amazon/EventBridge/ApiDestinations",
        "x-amzn-cipher-suite": "ECDHE-RSA-AES128-GCM-SHA256",
        "x-amzn-tls-version": "TLSv1.2",
        "x-amzn-vpc-id": "vpc-0a1db1c1701e137ca",
        "x-amzn-vpce-config": "1",
        "x-amzn-vpce-id": "vpce-09fc3c0c5173d919b",
        "x-api-key": "demo",
        "X-Forwarded-For": "10.0.195.243"
    },
    "multiValueHeaders": {
        "Accept-Encoding": [
            "gzip, x-gzip, deflate, br"
        ],
        "Content-Type": [
            "application/json; charset=utf-8"
        ],
        "Host": [
            "899aggxh3a.execute-api.us-east-1.amazonaws.com"
        ],
        "Range": [
            "bytes=0-1048575"
        ],
        "User-Agent": [
            "Amazon/EventBridge/ApiDestinations"
        ],
        "x-amzn-cipher-suite": [
            "ECDHE-RSA-AES128-GCM-SHA256"
        ],
        "x-amzn-tls-version": [
            "TLSv1.2"
        ],
        "x-amzn-vpc-id": [
            "vpc-0a1db1c1701e137ca"
        ],
        "x-amzn-vpce-config": [
            "1"
        ],
        "x-amzn-vpce-id": [
            "vpce-09fc3c0c5173d919b"
        ],
        "x-api-key": [
            "demo"
        ],
        "X-Forwarded-For": [
            "10.0.195.243"
        ]
    },
    "queryStringParameters": null,
    "multiValueQueryStringParameters": null,
    "pathParameters": null,
    "stageVariables": null,
    "requestContext": {
        "resourceId": "yjzggg",
        "resourcePath": "https://dev.to/hello",
        "httpMethod": "POST",
        "extendedRequestId": "Efl1aFY5oAMFoYA=",
        "requestTime": "16/Jan/2025:18:29:54 +0000",
        "path": "https://dev.to/prod/hello",
        "accountId": "438465158289",
        "protocol": "HTTP/1.1",
        "stage": "prod",
        "domainPrefix": "899aggxh3a",
        "requestTimeEpoch": 1737052194204,
        "requestId": "3a6754ae-5488-429c-9ec6-1837a4c21727",
        "identity": {
            "cognitoIdentityPoolId": null,
            "cognitoIdentityId": null,
            "vpceId": "vpce-09fc3c0c5173d919b",
            "apiKey": "demo",
            "principalOrgId": null,
            "cognitoAuthenticationType": null,
            "userArn": null,
            "userAgent": "Amazon/EventBridge/ApiDestinations",
            "accountId": null,
            "caller": null,
            "sourceIp": "10.0.195.243",
            "accessKey": null,
            "vpcId": "vpc-0a1db1c1701e137ca",
            "cognitoAuthenticationProvider": null,
            "user": null
        },
        "domainName": "899aggxh3a.execute-api.us-east-1.amazonaws.com",
        "deploymentId": "74v610",
        "apiId": "899aggxh3a"
    },
    "body": "{"foo":"bar"}",
    "isBase64Encoded": false
}

Conclusion

The new VPC Lattice and AWS Private Link integration allows developers to invoke Private APIs directly without needing a Lambda function. This reduces code, maintenance, and latency.

The post Private API Gateway as EventBridge API Destination appeared first on prodSens.live.

Simplified Data Masking in AWS Lambda with Powertool

Erika Heald — Fri, 03 Jan 2025 19:20:17 +0000

Hello Devs,

“Data is the new oil,” they say, but in healthcare and finance, it’s more like nitroglycerin—immensely valuable, yet dangerously explosive if mishandled.”

I recently shared on social media that I’ve joined the healthcare industry. This marks a shift from my background in both e-commerce and finance. During my time in finance, I dealt extensively with highly sensitive data like bank statements, KYC information, personal identification details, and other financial records. I vividly recall understanding that even a minor data handling error could have severe repercussions: breaches, fines, and, most importantly, a loss of public trust. This same level of data sensitivity exists in healthcare, where every piece of patient information is crucial and protected by regulations like GDPR and HIPAA. To comply with these regulations and similar ones worldwide, data masking has become essential. In this blog, I’ll break down what Powertools is, how it can be used for data masking in AWS Lambda, and why it’s critical for domains like finance and healthcare. While Powertools for AWS Lambda existed for .NET and TypeScript, the May 2024 release of Python support has been a major boost, making it accessible to a much wider audience. Let’s dive into Powertools for AWS Lambda.

What is Powertools for AWS Lambda?

Think of Powertools for AWS Lambda as a Swiss Army knife for serverless applications. It’s an open-source library that helps you write better, more secure, and more maintainable code. Instead of reinventing the wheel every time you need to mask data, log securely, or handle retries, Powertools provides ready-to-use utilities.

Key Features of Powertools for AWS Lambda

Powertools offers a robust set of features to simplify serverless development:

Tracer
Logger
Metrics
Event Handler
Parameters
Batch Processing
Typing
Validation
Event Source Data Classes
Parser (Pydantic)
Idempotency
Data Masking (Focus of this blog)
Feature Flags
Streaming
Middleware Factory
JMESPath Functions
CloudFormation Custom Resources

In this blog, we are going to explore Data Masking usage in detail. For data masking, Powertools ensures that only necessary data appears in your logs.

Image taken from official website.

Why is Data Masking Important in AWS Lambda?

When dealing with serverless functions, especially in industries like healthcare and finance:

Logs are often sent to systems like CloudWatch.
Sensitive information (like SSNs, credit card numbers, or medical IDs) can easily end up in plaintext logs.
Compliance standards (HIPAA for healthcare, PCI-DSS for finance) strictly prohibit exposing such data.

This is where Powertools for AWS Lambda comes into play.

How to Use Powertools for AWS Lambda for Data Masking

Let’s break this down step by step. I am taking example of Python here because it widely used and I used this my day to day activities.

1. Install Powertools

pip install aws-lambda-powertools // use pip3 for Python3 
pip install jsonpath_ng // if you get issue for jsonpath_ng. This mostly require for below example
pip install ply // if you get issue for ply. This mostly require for below example

2. Use the Logger Utility for Masking ( erase, encrypt, decrypt).

In this example we are using erase method to mask data. Powertools for AWS Lambda offers three primary functions for data masking:

erase: Removes sensitive data fields completely.
encrypt: Encrypts sensitive data fields.
decrypt: Decrypts previously encrypted fields.

from __future__ import annotations

from aws_lambda_powertools import Logger
from aws_lambda_powertools.utilities.data_masking import DataMasking
from aws_lambda_powertools.utilities.typing import LambdaContext

logger = Logger()
data_masker = DataMasking()


@logger.inject_lambda_context
def lambda_handler(event: dict, context: LambdaContext) -> dict:
    data: dict = event.get("body", {})

    logger.info("Erasing fields email, address.street, and company_address")

    erased = data_masker.erase(data, fields=["email", "address.street", "company_address", "aadhar", "diagnosis","blod_group"])  

    return erased

In this example, sensitive data like Aadhar, blood group, diagnosis is masked while still allowing logs to be useful for debugging.

Response:
{
  "id": 1,
  "name": "Avinash Dalvi",
  "age": 30,
  "email": "*****",
  "address": {
    "street": "*****",
    "city": "Bengaluru",
    "state": "KA",
    "zip": "211311"
  },
  "diagnosis": "*****",
  "blod_group": "*****",
  "aadhar": "*****",
  "company_address": "*****"
}

To use encrypt method refer this official documentation https://docs.powertools.aws.dev/lambda/python/latest/utilities/data_masking/#encrypting-data and for decrypt method use this https://docs.powertools.aws.dev/lambda/python/latest/utilities/data_masking/#encrypting-data

In this way we can validate compliance with Powertools and it is not only masks data but also helps with metrics and tracing to ensure compliance with industry standards like GDPR or HIPPA

Real-World Use Case: Healthcare Application Logs

Imagine a healthcare Lambda function that processes insurance claims. Without masking, logs might show:

INFO: Processing claim for Patient: Avinash Dalvi, Aadhar ID : 1233-2432-2233, Blood Group : O+

With Powertools:

INFO: Processing claim for Patient: Avinash Dalvi, Aadhar ID : *****, Blood Group : *****

A tiny change, but one that can save millions in potential fines and, more importantly, protect lives.

How It Impacts Healthcare and Finance

Healthcare: Ensures compliance with HIPAA by preventing PHI (Protected Health Information) leaks.
Finance: Aligns with PCI-DSS guidelines to prevent exposure of payment details.
Audit Readiness: Masked logs are audit-friendly while maintaining transparency for debugging.

In fast growing world of Serverless architectures, Powertools for AWS Lambda isn’t just tool – it’s a shield which protect us and make it compliant. Whether you are handling medical records, financial transactions or personal user data integrating Powertools ensures you are not just compliant but also responsible.

As I continue building solutions in the healthcare space, Powertools has become my go-to companion, making critical use cases easier, safer, and scalable.

Have you faced similar challenges in your Serverless journey? Share your thoughts below, and let’s keep building secure and responsible applications together.

I hope this blog helps you to learn. Feel free to reach out to me on my Twitter handle @avinashdalvi_ or leave a comment on the blog. Stay tuned for more learning for data masking using Powertools.

References:

The post Simplified Data Masking in AWS Lambda with Powertool appeared first on prodSens.live.

Serverless Cloud Pizzeria Shop

CryptoNinjas.net — Mon, 30 Sep 2024 01:20:14 +0000

I have spent lots of time learning about various areas of software and cloud development so far in my career and wanted to work on a project that combined many of those components. In recent years most of my focus has been on backend technologies but i wanted to get back to building some full-stack solutions. The full source for this project can be found here in Github: https://github.com/RDarrylR/serverless-pizza-ordering

Recently we have all heard a lot about the company Momento and the products they offer include Topics and Caches. These are truly serverless, scale-to-zero, pay as you go products that implement high-speed caching and pub/sub messaging solutions you can use in your projects.

I wanted to come up with an example project that could take advantage of these. What I am presenting today is a web-based ordering system (developed using React JS) that includes live updates of the progress of your pizza orders using Momento Topics. The backend for the project is in AWS and uses various components including API Gateway, AWS Lambda, Step Functions, the Elastic Container Service (ECS) with Fargate compute, DynamoDB with streams, and more. It uses a mix of Python and Rust for the code.

A common Momento Topic is used between the frontend app and the backend for each order. The frontend has a short-lived token that allows it to subscribe to the topic and receive updates. The backend pushes updates to the topic as the order progresses. Powerful AI actually creates the pizzas and delivers them to the customers based on some containers running in the ECS cluster. Of course we could have used AWS Lambda functions for the actual making of the pizzas and the delivery but the AI tells me there are some rare cases where it could take more than 15 minutes and that they prefer working with containers rather than some Firecracker Micro-VM when pizza is involved.

The Infrastructure as Code (IaC) tool i used for this project is Terraform. I like to use the Serverless Application Model (SAM) as well for my projects but Terraform does a great job when there are a lot of components required to be setup. In this project we are creating a VPC, subnets, ECS clusters, Step Functions, an API Gateway, and much more. Terraform can handle this all cleanly and with a nice set of config files.

Frontend

The front end consists of a typical online storefront (coded with React JS) with a list of products — which (in the Cloud Pizzeria) are one-size fits all types of classic pizzas. For some reason we did include Hawaiian pizza but that must have been a glitch in the system. Please note that you cannot customize the toppings on each pizza as the AI is very fussy claiming each of their creations is a classic and you can pick things off when you get it if you’re are really that concerned. You can choose between super-fast AI-based delivery or pick up the pizzas yourself.

The store allows you to see the descriptions of the pizzas, add them to a cart, etc. Before you can place an order you need to fill in your profile information including name, address, and phone number — after all the AI needs to know where to deliver it or who to call if you don’t show up.

Here are some pics of the store:

Once you have chose your products and filled in a profile you can go to the cart and complete your order. Make sure to pick your order type of Delivery or Pickup. After this you can sit back and watch the progress as the powerful AI makes your pizza and delivers it to your door (or at least tells you it is done if you chose to pick it up).

After you complete the purchase in the frontend, you are presented with a status of your order. As the order progresses you will see live updates and timestamps on them. Once your order is ready close the status page and get ready to order again.

The key to the front end getting the updates is it subscribes to the Momento Topic that the backend passed it an ID for. Once it’s listening for these updates it will be able to update the progress as the images above show. Below is a snippet of the code to set this up.

const topicClient = new TopicClient({
  configuration: TopicConfigurations.Browser.latest(),
  credentialProvider: CredentialProvider.fromString({
    apiKey: data.token
  })
}) 

...

await topicClient.subscribe(process.env.REACT_APP_MOMENTTO_CACHE, topic, {
  onItem: (item => {
    console.log('Received item:', item.value());
    try {
      const status = JSON.parse(item.value());
      console.log("status=", status.State)
      onOrderStatusUpdate(status);
    } catch (error) {
      console.error('Error parsing status update:', error);
      onOrderStatusUpdate({ message: item.value() });
    }
  }),
  onError: (error) => {
    alert(`Error subscribing to Momento topic: ${error.message}`)
  }
})

Backend design

The backend of this project is driven by API Gateway and AWS Step Functions. The API that is used to create orders is setup in API Gateway. The API Gateway is built using an Open API spec file as setting it up piece by piece in Terraform is rather ugly. The create order call to API gateway is backed by an AWS Lambda function coded in Python. This function creates an entry for the order in a DynamoDB table and generates a temporary token using the Momemto Auth Client, which is specific to the current order, and sends it back to the client in the response. Below is a snippet of the Lambda handler.

@tracer.capture_lambda_handler
@logger.inject_lambda_context(log_event=True)
@metrics.log_metrics(capture_cold_start_metric=True)
def lambda_handler(event: Dict[str, Any], context: Any) -> Dict[str, Any]:
    try:
        body: Dict[str, Any] = json.loads(event['body'], parse_float=Decimal)
        order_id: str = str(uuid.uuid4())
        timestamp: str = datetime.now(UTC).isoformat()

        item: Dict[str, Any] = {
            'orderId': order_id,
            'timestamp': timestamp,
            'status': 'PENDING',
            'items': body.get('items', []),
            'customer': body.get('customer', {}),
            'orderType': body.get('orderType', ''),
            'totalAmount': body.get('totalAmount', 0)
        }  

        table.put_item(Item=item)

        # Create an auth token so the user can track their order using the momento topic for the order
        momento_response = momento_auth_client.generate_disposable_token(
                    DisposableTokenScopes.topic_subscribe_only(momento_cache_name, f"{momento_topic_prefix}{order_id}"),
                    ExpiresIn.minutes(60))

        match momento_response:
            case GenerateDisposableToken.Success():
                logger.info("Successfully generated a disposable token", 
                            extra={
                                "auth_token": momento_response.auth_token,
                                "endpoint": momento_response.endpoint
                            })
            case GenerateDisposableToken.Error() as error:
                logger.info(f"Error generating a disposable token", 
                            extra={"error": error.message})

        return {
            'statusCode': 201,
            'headers': {
                'Access-Control-Allow-Origin': '*',
                'Access-Control-Allow-Headers': 'Content-Type,X-Amz-Date,Authorization,X-Api-Key,X-Amz-Security-Token',
                'Access-Control-Allow-Methods': 'OPTIONS,POST,GET'
            },
            'body': json.dumps(
                {
                    'orderId': order_id, 
                    'message': 'Order created successfully',
                    'token': momento_response.auth_token if hasattr(momento_response, 'auth_token') else None
                })
        }

The AWS Lambda function does not initiate the state machine that drives the order processing. I decided to go for a more event-driven approach and wanted to use DynamoDB streams. When the DynamoDB table has the new order inserted from the function above, the stream setup on the DynamoDB table will emit an event. The process_dynamodb_stream Lambda function — which has an Event Source Mapping (ESM) of events on this table setup on it — will be executed based on updates to the DynamoDB table.

The process_dynamodb_stream will parse the event and if it’s a new order in the PENDING state it will initiate an execution of the Process-Pizza_Order-State-Machine Step Function for the new order. Below is a snippet of the process_dynamodb_stream function.

@tracer.capture_lambda_handler
@logger.inject_lambda_context(log_event=True)
@metrics.log_metrics(capture_cold_start_metric=True)
def lambda_handler(event: Dict[str, Any], context: LambdaContext) -> Dict[str, Any]:
    try:
        process_partial_response(event=event, record_handler=record_handler, processor=processor, context=context)
        return {"statusCode": 200, "body": json.dumps({"message": "Stream processing completed successfully"})}

    except Exception as e:
        logger.exception("Error processing DynamoDB stream")
        return {"statusCode": 500, "body": json.dumps({"error": str(e)})}

def process_new_order(new_image: Dict[str, Any]) -> None:
    logger.info(f"Processing new order with new image: {new_image}")
    order_id = new_image.get("orderId", {})
    status = new_image.get("status", {})

    if status == "PENDING":
        try:
            execution_id = stepfunctions.start_execution(
                stateMachineArn=os.environ['STATE_MACHINE_ARN'],
                input=json.dumps(
                    {
                        "orderId": order_id,
                        "orderType": new_image.get("orderType", {}),
                        "ordersTableName": os.environ['ORDERS_TABLE'],
                        "customer": new_image.get("customer", {})
                     }                     
                )
            )
            logger.info(f"Started state machine execution for order={order_id}, execution_id={execution_id}")
        except Exception as e:
            logger.error(f"Failed to start state machine for order {order_id}: {str(e)}")

The AWS Step Function is the main driver of the workflow to process the order. Step Functions can call almost any AWS service or any type of external API. In the example I am using the HTTP Task type in Step Functions to call the Momemento Topics HTTP API to update status of the order being processed. The actual work of making the pizza and delivering it is being done by advanced AI which is controlled using containers running in ECS with Fargate compute. The state machine passes a Task Token to the ECS Tasks which have to return it when they are done processing their work. Inside the ECS Tasks, the status of the order gets updated in the DynamoDB tables. The state machine also has to know whether to send the AI to deliver the order or whether the customer is picking it up. Below is a diagram of the state machine and one showing an example of a completed order.

Container Code

The code needed to control the AI to actually make the pizzas and deliver them is super complex and they insisted I use Rust for it since it is the only thing the AI is told it can trust. Unfortunately, I had to remove the core of that code before i checked it into Github but left a skeleton of it. It still shows the Step Functions token handling and the DynamoDB updates.

Try the example in your AWS account

You can clone the Github Repo and try this out in your own AWS account. The README.md file mentions any changes you need to make for it to work in your AWS account.

Please let me know if you have any suggestions or problems trying out this example project.

For more articles from me please visit my blog at Darryl’s World of Cloud or find me on X, LinkedIn, Medium, Dev.to, or the AWS Community.

For tons of great serverless content and discussions please join the Believe In Serverless community we have put together at this link: Believe In Serverless Community

The post Serverless Cloud Pizzeria Shop appeared first on prodSens.live.

Unlocking the Potential of Cloudflare Workers for Small Projects

Sapphire Reels — Sat, 27 Jul 2024 10:20:12 +0000

Cloudflare Workers is a serverless platform that allows developers to run code at the edge, close to the end users. Unlike some other serverless platforms that run in centralized data centers, Cloudflare Workers run in over 200 locations worldwide, providing lower latency and high performance. This article explores why Cloudflare Workers is a fantastic choice for smaller projects and offers a practical use case of creating a Telegram bot using Cloudflare Workers.

Why Choose Cloudflare Workers for Smaller Projects?

1. Low Latency and High Performance

Imagine your code running in data centers closest to your users, reducing latency significantly and providing a fast, responsive experience. Cloudflare Workers make this possible by running at the edge. This advantage is particularly crucial for smaller projects with a distributed user base. Learn more about Cloudflare’s edge network.

2. Cost Efficiency

Budget constraints are a common concern for small projects. Cloudflare Workers offers a generous free tier that includes up to 100,000 requests per day, often more than enough for smaller projects. Beyond that, the pricing remains competitive, helping you keep costs minimal. See Cloudflare Workers pricing.

3. Ease of Deployment and Maintenance

One of the most appealing aspects of Cloudflare Workers is the simplicity of deployment. There’s no need to manage servers or worry about infrastructure. This ease of use allows you to focus on what matters most: developing and deploying your project quickly. Get started with Cloudflare Workers.

4. Scalability

Even small projects can experience unexpected traffic spikes. Cloudflare Workers scale automatically to handle increased load, ensuring your project remains responsive and reliable without requiring any intervention. Learn about Cloudflare’s scalability.

5. Built-in Security

Security is crucial, especially for web applications. Cloudflare provides built-in DDoS protection, SSL/TLS encryption, and other security features to protect your application without additional configuration or cost. For smaller projects, this means robust security without the hassle. Explore Cloudflare’s security features.

Key-Value Storage and Database Services

Cloudflare offers additional services that complement Cloudflare Workers, making it even more powerful for small projects.

Cloudflare Workers KV

Workers KV is a global, low-latency key-value storage system. It allows you to store and retrieve data quickly, making it perfect for caching, configuration, and session management. With Workers KV, you can:

Store data globally: Data is replicated across Cloudflare’s edge network, providing high availability and low latency access.
Manage configurations: Store configuration data that needs to be quickly accessible by your Worker scripts.
Cache responses: Cache static assets or frequently accessed data to improve performance.

Learn more about Workers KV.

Cloudflare D1

Cloudflare D1 is a managed SQL database built on SQLite, designed for serverless applications. It integrates seamlessly with Cloudflare Workers, enabling you to use a familiar SQL interface for data management. D1 is ideal for:

Prototyping and development: Quickly spin up a database for your applications without managing infrastructure.
Small to medium-sized applications: Use a scalable SQL database for applications that require relational data storage.

Learn more about Cloudflare D1.

Perfect for Prototyping and Hackathons

Cloudflare Workers shine in environments where rapid development and deployment are crucial, such as prototyping and hackathons.

1. Rapid Deployment

In hackathons and prototyping, time is of the essence. Cloudflare Workers enable you to deploy your code almost instantly without setting up servers or worrying about infrastructure, allowing you to focus on developing features and iterating quickly.

2. Minimal Setup

Getting started with Cloudflare Workers requires minimal setup, making it perfect for hackathons where simplicity and speed are vital.

3. Real-Time Testing

Cloudflare Workers run at the edge, allowing you to test your application in real-time and see how it performs under different network conditions. This immediate feedback is invaluable for refining your prototype or hackathon project on the fly.

4. Scalable Infrastructure

During a hackathon, you may not know how many users will interact with your project. Cloudflare Workers’ ability to scale automatically ensures that your application can handle unexpected spikes in traffic, providing a seamless experience for users and judges alike.

5. Cost-Effective Experimentation

Hackathons and prototypes often operate on a limited budget. With Cloudflare Workers’ generous free tier, you can experiment with different ideas without worrying about incurring significant costs. This allows for more creative freedom and innovation.

USECASE: Creating a Telegram Bot with Cloudflare Workers

Let’s create a simple Telegram bot using Cloudflare Workers, integrating Workers KV for storing user data and Cloudflare D1 for managing a simple database. This bot will respond to user messages with a predefined response and store user messages in a database.

Step 1: Set Up Your Cloudflare Worker

Create a Cloudflare Account: If you don’t have one, sign up at Cloudflare.
Set Up a Worker: In the Cloudflare dashboard, navigate to the Workers section and create a new Worker.

Step 2: Configure Workers KV

Create a KV Namespace: In the Workers section of the Cloudflare dashboard, go to the “KV” tab and create a new namespace. Note the KV_NAMESPACE_ID.
Bind the KV Namespace to Your Worker: In the Worker configuration, add a binding for the KV namespace:

{
  "bindings": [
    {
      "type": "kv_namespace",
      "name": "MY_KV_NAMESPACE",
      "namespace_id": "KV_NAMESPACE_ID"
    }
  ]
}

Step 3: Set Up Cloudflare D1

Create a D1 Database: In the Cloudflare dashboard, go to the D1 section and create a new database. Note the D1_DATABASE_ID.
Bind the D1 Database to Your Worker: In the Worker configuration, add a binding for the D1 database:

{
  "bindings": [
    {
      "type": "d1",
      "name": "MY_D1_DATABASE",
      "database_id": "D1_DATABASE_ID"
    }
  ]
}

Step 4: Write the Bot Code

Here’s a basic example of a Telegram bot using Cloudflare Workers, Workers KV, and Cloudflare D1:

addEventListener('fetch', event => {
  event.respondWith(handleRequest(event.request))
})

async function handleRequest(request) {
  const url = new URL(request.url)
  const pathname = url.pathname

  if (pathname === '/webhook') {
    const data = await request.json()
    const message = data.message
    const chatId = message.chat.id
    const text = 'Hello from Cloudflare Workers!'

    // Store user message in KV
    await MY_KV_NAMESPACE.put(chatId, message.text)

    // Insert user message into D1 database
    await MY_D1_DATABASE.query(
      `INSERT INTO messages (chat_id, message) VALUES (?, ?)`,
      [chatId, message.text]
    )

    await sendMessage(chatId, text)
    return new Response('OK')
  }

  return new Response('Not Found', { status: 404 })
}

async function sendMessage(chatId, text) {
  const token = 'YOUR_TELEGRAM_BOT_TOKEN'
  const url = `https://api.telegram.org/bot${token}/sendMessage`

  const response = await fetch(url, {
    method: 'POST',
    headers: {
      'Content-Type': 'application/json'
    },
    body: JSON.stringify({
      chat_id: chatId,
      text: text
    })
  })

  return response.json()
}

Step 5: Configure Telegram Webhook

Create a Telegram Bot: If you haven’t already, create a bot by talking to BotFather on Telegram.
Set the Webhook URL: Use the following command to set your webhook URL to your Cloudflare Worker URL:

   https://api.telegram.org/bot/setWebhook?url=https://.workers.dev/webhook

Step 6: Test Your Bot

Send a message to your bot on Telegram, and it should respond with “Hello from Cloudflare Workers!” It will also store the message in the Workers KV and insert it into the D1 database.

Conclusion

Cloudflare Workers provide a powerful, cost-effective, and scalable solution for smaller projects. With low latency, ease of deployment, built-in security, and scalability, Cloudflare Workers are an excellent choice for developers looking to build and deploy applications quickly. The example of creating a Telegram bot demonstrates how simple and effective it can be to use Cloudflare Workers, Workers KV, and Cloudflare D1 for your projects.

Whether you’re prototyping a new idea or participating in a hackathon, Cloudflare Workers offer the speed, simplicity, and flexibility needed to turn your vision into reality swiftly and efficiently. And with the added power of Cloudflare’s key-value storage and managed database services, you have all the tools you need to build robust and responsive applications.

The post Unlocking the Potential of Cloudflare Workers for Small Projects appeared first on prodSens.live.