<rss version="2.0" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:atom="http://www.w3.org/2005/Atom"><channel><title>Hacker News: acaciabengo</title><link>https://news.ycombinator.com/user?id=acaciabengo</link><description>Hacker News RSS</description><docs>https://hnrss.org/</docs><generator>hnrss v2.1.1</generator><lastBuildDate>Fri, 26 Jun 2026 01:55:34 +0000</lastBuildDate><atom:link href="https://hnrss.org/user?id=acaciabengo" rel="self" type="application/rss+xml"></atom:link><item><title><![CDATA[New comment by acaciabengo in "Launch HN: Transload (YC P26) – Measuring freight items with CCTV"]]></title><description><![CDATA[
<p>Interesting work. Is this something where SAM-3D or it background applies to?</p>
]]></description><pubDate>Wed, 10 Jun 2026 19:42:51 +0000</pubDate><link>https://news.ycombinator.com/item?id=48481586</link><dc:creator>acaciabengo</dc:creator><comments>https://news.ycombinator.com/item?id=48481586</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48481586</guid></item><item><title><![CDATA[Show HN: QuantTakeoff – Construction PDFs to takeoff and 3D scene]]></title><description><![CDATA[
<p>I built QuantTakeoff and releasing v1.0 for validation: Input a construction PDF, get back 
takeoff report with wall lengths, areas, door/window counts and sizes
3D GLB of the building at real-world scale all under ~10 mins.<p>The pain it solves: Reduce time estimator to trace out elements either by hand or software and extract out reports.<p>Stack: Ensemble of computer vision tools (95%), VLM OCR (5%)<p>Hard parts that are working:
Auto-find the plan page in a 200-sheet bid set (most stacks make you point at the right sheet manually). I am still manually annotating a new dataset for the new models.
 Holds up on noisy as-builts and hand-marked sheets that break OCR-first pipelines
Post CV processing to optimize for usability.
Pixel accuracy to measure elements including size / width of doors and windows.<p>Demo: <a href="https://youtu.be/fVy7tDFqR98" rel="nofollow">https://youtu.be/fVy7tDFqR98</a><p>Particularly want feedback on:
Real world estimators feedback on what could be better for the practice.
What would make use this or what is missing?<p>P.S Live demo has been removed to manage compute costs from a few enthusiasts who burn through the HF spaces bill.</p>
<hr>
<p>Comments URL: <a href="https://news.ycombinator.com/item?id=48164588">https://news.ycombinator.com/item?id=48164588</a></p>
<p>Points: 1</p>
<p># Comments: 0</p>
]]></description><pubDate>Sat, 16 May 2026 23:09:04 +0000</pubDate><link>https://news.ycombinator.com/item?id=48164588</link><dc:creator>acaciabengo</dc:creator><comments>https://news.ycombinator.com/item?id=48164588</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48164588</guid></item><item><title><![CDATA[New comment by acaciabengo in "Ask HN: Who wants to be hired? (May 2026)"]]></title><description><![CDATA[
<p>Location: Kampala, Uganda                                                             
Remote: Yes | Relocate: No<p>Tech: Python, PyTorch, Detectron2, OpenCV, Swin/ViT, BERT, Hugging Face, Ruby on
Rails, FastAPI, PostgreSQL, Airflow, RabbitMQ, Docker, GCP.<p>Email: acaciabengo@gmail.com<p>Resume:<p><a href="https://drive.google.com/file/d/1oOeY0tsJ7Ujx2fk3m5BkSfBhto24sBzw/view?usp=sharing" rel="nofollow">https://drive.google.com/file/d/1oOeY0tsJ7Ujx2fk3m5BkSfBhto2...</a><p>GitHub / HF: acaciabengo<p>ML + Software Engineer, 11+ yrs. MSCS (ML) at Georgia Tech. Ex-founder/CTO of a telecom/fintech platform (60M+ SMS, 5M+ USSD lottery tickets, LTV/churn models that
cut marketing ~50%).<p>Recent:<p>- plan_to_3d — Floor plan PDFs → interactive 3D GLB. Mask R-CNN + Swin-T/FPN in       
Detectron2 for wall/door/window segmentation; Shapely vectorization, trimesh extrusion
  with opening clipping, OCR-based scale, Gradio + Three.js viewer.<p>- NLP distress/moderation models for 13K users; Airflow pipelines on
1M+ msgs/month; Rails backend unifying Twilio/FB/YouTube (220K+ interactions).        
- CensorX — Open-sourced DistilBERT/ViT moderation models on HF (97% / 92% precision).<p>Looking for applied CV / multimodal ML, or senior ML+backend roles where shipping
end-to-end matters.</p>
]]></description><pubDate>Wed, 06 May 2026 21:12:51 +0000</pubDate><link>https://news.ycombinator.com/item?id=48041934</link><dc:creator>acaciabengo</dc:creator><comments>https://news.ycombinator.com/item?id=48041934</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48041934</guid></item><item><title><![CDATA[Ask HN: Advice on Solo Launching]]></title><description><![CDATA[
<p>I have built a pipeline that takes construction documents as PDFs, extracts out the floor plans.<p>It extracts out BIM data from the plans at pixel accuracy producing data on walls, floors, windows and doors accurately.<p>It also generates 3D impressions and a brief 10s 3D video of the floor plans.<p>I have had multiple discussion to build within a company already solving the same problem but not closed a deal yet and considering taking it to the market as a solo founder.<p>I am outside the UK and USA and was seeking a founder for market access and future fundraising.<p>What is the best advice on how to go to market as a sole founder?</p>
<hr>
<p>Comments URL: <a href="https://news.ycombinator.com/item?id=47628282">https://news.ycombinator.com/item?id=47628282</a></p>
<p>Points: 6</p>
<p># Comments: 5</p>
]]></description><pubDate>Fri, 03 Apr 2026 15:56:35 +0000</pubDate><link>https://news.ycombinator.com/item?id=47628282</link><dc:creator>acaciabengo</dc:creator><comments>https://news.ycombinator.com/item?id=47628282</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47628282</guid></item><item><title><![CDATA[New comment by acaciabengo in "Ask HN: Who wants to be hired? (March 2026)"]]></title><description><![CDATA[
<p>Location: Kampala, Uganda<p>Remote: Yes<p>Willing to relocate: No<p>Technologies: Python, PyTorch, TensorFlow, OpenCV, Detectron2, Ruby on Rails, Docker, GCP, SQL.<p>Resume: <a href="https://drive.google.com/file/d/1G8Rzgb7a2kS8myjnJqhxdALUA2dalOSz/view" rel="nofollow">https://drive.google.com/file/d/1G8Rzgb7a2kS8myjnJqhxdALUA2d...</a><p>Email: acaciabengo@gmail.com<p>Summary: Machine Learning Engineer & Software Engineer with 11+ years of experience, MSCS at Georgia Tech (Machine Learning).<p>Recent work includes a 2D→3D architectural reconstruction pipeline (Detectron2 + Swin Transformers), large-scale predictive modelling for gaming platforms (risk, churn, LTV), and NLP systems processing 1M+ messages/month for a US health-tech company.</p>
]]></description><pubDate>Wed, 04 Mar 2026 19:41:34 +0000</pubDate><link>https://news.ycombinator.com/item?id=47252741</link><dc:creator>acaciabengo</dc:creator><comments>https://news.ycombinator.com/item?id=47252741</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47252741</guid></item><item><title><![CDATA[New comment by acaciabengo in "Show HN: Emotional photoreal AI humans at $0.06 / min"]]></title><description><![CDATA[
<p>This is great and exciting. I happened to be doing some research  to build memory-efficient diffusion models. I have not yet built the demo, but looking at a mix of architecture from several papers, IMTalker, SageAttension, FlashVSR, and Sparse VideoGen, with the intention to reduce memory to about 8GB.<p>The plan was to swap FlashAttention out, and also for an audio driver; SVG could have improved. At 60FPS, I think you are already doing this.<p>Great work.</p>
]]></description><pubDate>Thu, 26 Feb 2026 20:09:00 +0000</pubDate><link>https://news.ycombinator.com/item?id=47171374</link><dc:creator>acaciabengo</dc:creator><comments>https://news.ycombinator.com/item?id=47171374</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47171374</guid></item><item><title><![CDATA[Show HN: Turning 2D floor plans into 3D-ready JSON with Detectron2]]></title><description><![CDATA[
<p>Hey HN,<p>For the past few weekends, I have been working on a computer vision pipeline to solve a specific PropTech problem: turning messy, highly occluded 2D floor plans into clean, structured data for 3D extrusion. I originally built this as a demo for a firm.<p>The Stack & Architecture I built an instance segmentation pipeline that relies strictly on pixel-perfect masking to extract the geometry.<p>The Backbone: Swin Transformer + Detectron2 + OpenCV
Training: Trained on 1024x1024 images using an RTX 4090.
Inference: Runs on CPU in < 10s.
Demo Performance: 67.1% AP50 for instance segmentation masks, and 38.2% AP across the strict 0.50:0.95 IoU thresholds.<p>Why I'm posting:  A lot of virtual staging and architectural startups have beautiful Three.js rendering engines, but still rely on manual data entry to build the base geometry. I built this specifically as an extraction engine to sit underneath those UIs.<p>If you are in PropTech or building a product that could benefit from embedding this model under the hood, I would love to chat.</p>
<hr>
<p>Comments URL: <a href="https://news.ycombinator.com/item?id=47170196">https://news.ycombinator.com/item?id=47170196</a></p>
<p>Points: 2</p>
<p># Comments: 0</p>
]]></description><pubDate>Thu, 26 Feb 2026 18:39:58 +0000</pubDate><link>https://www.loom.com/share/b741bc127c814db395832b01bd086e96</link><dc:creator>acaciabengo</dc:creator><comments>https://news.ycombinator.com/item?id=47170196</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47170196</guid></item><item><title><![CDATA[Instance segmentation model that extracts 3D geometry from 2D floor plans]]></title><description><![CDATA[
<p>Hey HN,<p>I am an ML Engineer and a full-stack software engineer. For the past few weekends, I have been working on a pipeline to solve a PropTech problem: turning messy, highly occluded 2D floor plans into clean, structured data for 3D extrusion. Originally demoed for a firm hiring for the role.<p>The Problem:
If you try to use standard object detection (bounding boxes) or basic OCR (tested Qwen, DeepSeek) on architectural plans, it fails instantly. Walls intersect, doors swings and dimension lines heavily occlude the actual structures.<p>The Stack & Architecture:
I built an instance segmentation pipeline that relies strictly on pixel-perfect masking to pull the geometry.<p>The Backbone: Swin Transformer + Detectron2.<p>Model trained on 1024X1024 images, with RTX 4090<p>Inference: Inference on CPU <10s.<p>Output:  Clean JSON<p>Demo Performance:
67.1% AP50 for instance segmentation masks, and a 38.2% AP across the strict 0.50:0.95 IoU thresholds.<p>Vector Clean-up (The JSON Payload): 3D engines don't want pixel masks; they want math. The pipeline passes the raw predictions through Shapely to run boolean unions on intersecting walls, outputting clean, mathematically sound 2D polygons in a structured JSON payload.<p>Why am I posting:
A lot of virtual staging and architectural startups have beautiful Three.js rendering engines, but they still rely on human data entry to get the base data. I built this specifically as an extraction engine to sit underneath those UIs.<p>If you are in the PropTech or you are building a product that could benefit from embedding this model, I would love to chat.</p>
<hr>
<p>Comments URL: <a href="https://news.ycombinator.com/item?id=47092547">https://news.ycombinator.com/item?id=47092547</a></p>
<p>Points: 2</p>
<p># Comments: 0</p>
]]></description><pubDate>Fri, 20 Feb 2026 19:17:57 +0000</pubDate><link>https://news.ycombinator.com/item?id=47092547</link><dc:creator>acaciabengo</dc:creator><comments>https://news.ycombinator.com/item?id=47092547</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47092547</guid></item><item><title><![CDATA[New comment by acaciabengo in "Ask HN: Who wants to be hired? (February 2026)"]]></title><description><![CDATA[
<p>Location: Kampala, Uganda<p>Remote: Yes<p>Willing to relocate:No<p>Technologies: Python, PyTorch, TensorFlow, OpenCV, Detectron2, Ruby on Rails, NLP (Transformers, ViT), Docker, GCP, SQL.<p>Resume: <a href="https://drive.google.com/file/d/1G8Rzgb7a2kS8myjnJqhxdALUA2dalOSz/view?usp=sharing" rel="nofollow">https://drive.google.com/file/d/1G8Rzgb7a2kS8myjnJqhxdALUA2d...</a><p>Email: acaciabengo@gmail.com<p>Linkedin: <a href="https://linkedin.com/in/acaciabengo" rel="nofollow">https://linkedin.com/in/acaciabengo</a><p>HuggingFace: <a href="https://huggingface.co/acaciabengo" rel="nofollow">https://huggingface.co/acaciabengo</a><p>Description: I am a Senior Machine Learning & Software Engineer with 11+ years of experience and a current MSCS student at Georgia Tech. I have worked remotely for US-based companies for the last 3+ years and specialize in bridging the gap between robust backend engineering (Ruby on Rails) and production-grade ML models.<p>Key Projects & Experience:
• Computer Vision (2D to 3D): Currently building a pipeline to convert 2D architectural floor plans into 3D models using Image Segmentation (Detectron2) and Swin Transformers.<p>• ML for Gaming: Engineered predictive algorithms for high-volume sports betting and lottery platforms, including models for risk management, user segmentation, Churn Prediction and LTV forecasting.<p>• NLP at Scale: Architected Deep Learning models for a US-based health tech organization that reduced moderation time by 80% and processed over 1 million messages monthly.<p>• Content Moderation: Developed CensorX, a multimodal NSFW detection tool using Vision Transformers (ViT) and DistilBERT.
Note on Hiring: I am hireable through a Canadian company (B2B/Contract) or via an Employer of Record (e.g., Globalization Partners), allowing for frictionless onboarding for North American entities.</p>
]]></description><pubDate>Tue, 03 Feb 2026 11:11:02 +0000</pubDate><link>https://news.ycombinator.com/item?id=46869446</link><dc:creator>acaciabengo</dc:creator><comments>https://news.ycombinator.com/item?id=46869446</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=46869446</guid></item><item><title><![CDATA[New comment by acaciabengo in "Open source NSFW detection (ViT and DistilBERT) with 99% AUC"]]></title><description><![CDATA[
<p>I have been working on CensorX, a multimodal content moderation set of models. It is from a personal project where I built content moderation in a Discord Bot.<p>I have open-sourced the fine-tuned models on Hugging Face and am looking for feedback on false positives/negatives in real-world scenarios.<p>The main exploration has been ablations based on freezing certain layers of the transformers. More work could be explored by tuning other parameters and expanding the datasets.<p>The Models:
• Image (ViT-B/16): Fine-tuned Vision Transformer achieving 91.9% Accuracy and 0.99 AUC.
o Link: <a href="https://huggingface.co/acaciabengo/nsfw_image_detection" rel="nofollow">https://huggingface.co/acaciabengo/nsfw_image_detection</a>
• Text (DistilBERT): Binary classifier trained on ~200k samples.
o Focus: Optimized for low-latency inference (<100ms) to fit into real-time chat streams.
o Link: <a href="https://huggingface.co/acaciabengo/nsfw_text_detection" rel="nofollow">https://huggingface.co/acaciabengo/nsfw_text_detection</a>
How to try it:
1. Self-Host (Free): You can pull the weights directly from Hugging Face and run them in your own container.
2. Managed API (Freemium): I have deployed these exact models as a high-availability API on RapidAPI. There is a free tier for testing. RapidAPI
I am very interested in feedback on:
• Performance
• Access to larger datasets
• Shared experience from people who have handled similar tasks
Thank You</p>
]]></description><pubDate>Sun, 18 Jan 2026 15:18:02 +0000</pubDate><link>https://news.ycombinator.com/item?id=46668475</link><dc:creator>acaciabengo</dc:creator><comments>https://news.ycombinator.com/item?id=46668475</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=46668475</guid></item><item><title><![CDATA[Open source NSFW detection (ViT and DistilBERT) with 99% AUC]]></title><description><![CDATA[
<p>Article URL: <a href="https://huggingface.co/acaciabengo">https://huggingface.co/acaciabengo</a></p>
<p>Comments URL: <a href="https://news.ycombinator.com/item?id=46668474">https://news.ycombinator.com/item?id=46668474</a></p>
<p>Points: 2</p>
<p># Comments: 1</p>
]]></description><pubDate>Sun, 18 Jan 2026 15:18:02 +0000</pubDate><link>https://huggingface.co/acaciabengo</link><dc:creator>acaciabengo</dc:creator><comments>https://news.ycombinator.com/item?id=46668474</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=46668474</guid></item></channel></rss>