Phase 1 — Months 1 to 3: The pilot
The pilot phase is where you prove the concept with minimal risk and maximum learning. Get this right and you build momentum. Get it wrong and you set AI adoption back by years in your organisation.
Month 1: Setup and baseline.
Week 1-2: Select your pilot project type. The ideal pilot has:
- Standardised drawing formats (consistent CAD standards, clear annotations, regular layouts)
- Repetitive building elements (residential new-build, hotel fit-out, or office fit-out are ideal)
- A live tender or recent completed tender where you have both the drawings and the measured quantities
- A QS team willing to participate (volunteers, not conscripts)
The worst pilot choices are: complex refurbishment (ambiguous drawings, unknowns), heavy civil engineering (bespoke structures), or one-off bespoke buildings (unique elements that do not recur).
Week 3-4: Establish baselines. Take a recent completed tender and record:
- Total hours spent on quantity takeoff
- Hours by work section (how long did substructure take versus finishes versus services?)
- Known errors discovered post-tender (items that were wrong in the estimate)
- The tender timeline from drawing receipt to tender submission
These baseline numbers are your "before" measurement. Without them, you cannot prove improvement.
You need to select a pilot project type for AI-assisted estimation. Your firm does residential new-build, commercial office fit-out, hospital refurbishment, and infrastructure (roads and bridges). Which is the best pilot candidate?
Month 2-3: Run AI and manual in parallel
Month 2: Parallel processing.
Select 3-5 live tenders of the pilot project type. For each tender, produce the quantity takeoff twice:
- Once using your established manual process (or digital takeoff tool)
- Once using the AI-assisted pipeline
The parallel run is essential. It builds the comparison data you need to validate accuracy, and it builds the QS team's understanding of what the AI produces and where it needs correction.
For each parallel run, record:
- Time taken for AI-assisted takeoff (including QS review and adjustment)
- Time taken for manual takeoff
- Quantity differences between AI and manual for each work section
- Where the AI was right and the manual was wrong (this will happen — AI catches things humans miss)
- Where the AI was wrong and needed correction
Month 3: Analysis and decision.
Compile the parallel run data. You should now have clear answers to:
- How much time does AI save on first-pass measurement? (Target: 40-60%)
- How accurate is the AI first pass compared to final reviewed quantities? (Target: 85-95%)
- Where does AI perform well? (Typically: repetitive floor areas, door/window counts, linear measurements)
- Where does AI struggle? (Typically: complex shapes, ambiguous drawings, items requiring specification cross-reference)
- What is the QS team's assessment of the tool's usefulness?
The decision point. At the end of Month 3, you present the pilot results to the business. If AI saves meaningful time with acceptable accuracy, you proceed to Phase 2. If it does not, you understand why and either adjust the approach or conclude that the technology is not yet ready for your project types.
This is not a failure condition — it is a responsible evaluation. A pilot that concludes "not yet for our project types" is more valuable than a premature rollout that produces bad quantities.
After the pilot, your QS team reports: 'The AI saves time on measurement but the item descriptions are not NRM 2 compliant — we spend too long fixing them.' What is the correct response?
Phase 2 — Months 4 to 6: Expand to additional project types
If the pilot succeeds, Phase 2 expands the scope methodically.
Month 4: Additional project types.
Take the lessons from the residential pilot and apply them to your next most common project type — typically commercial fit-out or education new-build. Expect that each project type needs some configuration adjustment:
- Different drawing conventions (different architects have different drafting standards)
- Different specification structures
- Different work section emphasis (fit-out is heavy on finishes; new-build is heavy on substructure and frame)
Run another parallel validation — but this time, you only need 2-3 projects because your team now knows what to look for.
Month 5: Build the rate library.
If you have not already, this is the month to structure your rate library for AI consumption. This means:
- Extracting rates from your last 2-3 years of priced tenders
- Structuring them by NRM 2 work section with consistent description formats
- Adding metadata: project type, region, date, market conditions
- Connecting the rate library to the AI system so it can match measured items to rates automatically
This is a significant effort — expect 2-4 weeks of a senior QS's time to structure the initial library, plus ongoing maintenance.
Month 6: Process refinement.
By now, you should have AI-assisted takeoff running on two or three project types with a connected rate library. Month 6 is about smoothing the workflow:
- Standardising the QS review process (what to check, in what order, to what tolerance)
- Building templates for the AI extraction prompts (so each new project does not start from a blank prompt)
- Documenting the process (so it is not dependent on one person's knowledge)
- Measuring cumulative time savings and accuracy across all projects processed to date
During Phase 2, you discover that AI performs well on new-build projects but poorly on refurbishment projects. Refurbishment drawings show existing and new work on the same sheet, and the AI cannot reliably distinguish between them. What do you do?
Phase 3 — Months 7 to 12: Integrate with existing workflows
Phase 3 is about making AI-assisted estimation the default workflow, not an experiment.
Integration with existing tools. Your QS team uses specific software — CostX, Buildsoft Cubit, Bluebeam, or Excel. AI-assisted takeoff must feed into these tools, not replace them. This means:
- Exporting AI-generated quantities in formats your existing tools can import (CSV, Excel, CostX import format)
- Connecting the rate library so AI-matched rates flow into the pricing spreadsheet or estimating software
- Ensuring the BOQ output format matches your firm's standard templates
Process standardisation. Document the complete workflow:
- Drawing receipt and classification
- AI processing (with standard extraction prompts per project type)
- Automated validation checks
- QS review protocol (what to check, flagged items, spot checks)
- Description enrichment and specification matching
- Rate matching and pricing
- Final compilation and review
- Sign-off
This workflow document is not bureaucracy — it is the basis for consistent quality. When a new QS joins the team, this is their onboarding guide.
Change management. By Month 7-12, you will encounter the full spectrum of adoption attitudes:
- Enthusiasts — already pushing the boundaries of what AI can do. Channel their energy into developing new use cases and refining prompts.
- Pragmatists — using AI where it clearly saves time, reverting to manual where it does not. This is the right attitude. Support them.
- Sceptics — still unconvinced. Do not argue philosophy. Show them the time savings data from their own project types. Let them see a colleague produce a first-pass takeoff in two days instead of two weeks.
- Resistors — actively opposed. Usually driven by fear (of job loss, of being deskilled, of looking incompetent). Address the fear directly: AI changes the QS role from measurement to review and management, which is a promotion in every meaningful sense.
The technology stack — what to buy vs what to build. By this phase, you need clarity:
- Buy if a commercial tool exists that handles your project types and integrates with your software. Most mid-size contractors should buy, not build.
- Build only if your project types are so specialised that no commercial tool handles them, AND you have internal development capability. Building custom AI pipelines is expensive and requires ongoing maintenance.
- Hybrid — use commercial tools for the AI processing, build the integration layer (export formats, rate library connection, validation checks) in-house or with a specialist integrator.
Measuring success — KPIs for construction AI adoption
You cannot manage what you do not measure. Define KPIs before you start, track them consistently, and report them to the business.
Efficiency KPIs:
- Time per tender takeoff — hours from drawing receipt to complete first-pass quantities. Track the trend over time.
- QS review time — hours spent reviewing and adjusting AI output. This should decrease as the AI is refined.
- Tenders per estimator per year — the capacity metric. If each estimator can handle more tenders, the team's output increases.
Quality KPIs:
- First-pass accuracy — percentage of AI-generated quantities within 5% of final reviewed quantities. Track by work section to identify where AI is strong and where it needs improvement.
- Discrepancies caught — number of drawing/schedule/specification inconsistencies flagged by AI per tender. This is a quality metric — more catches is better.
- Post-tender estimation variance — the difference between tender estimate and final account measured work. This should decrease as estimation accuracy improves.
Adoption KPIs:
- Percentage of tenders using AI-assisted takeoff — track adoption across the team. The target is not 100% (some project types may not be suitable) but it should be increasing.
- User satisfaction — regular (quarterly) feedback from the QS team on what is working and what is not.
- Training completion — ensure every estimator and QS has completed the AI-assisted workflow training.
Commercial KPIs:
- Tender hit rate — are you winning more work? This is influenced by many factors beyond estimation, but a sustained improvement supports the AI investment case.
- Project margin performance — are projects priced with AI-assisted estimation achieving target margins? This takes 12-24 months to measure fully but is the ultimate test.
Report these KPIs quarterly to the business. Be honest about what is working and what is not. Transparent reporting builds credibility for continued investment.
After 9 months of AI adoption, your first-pass accuracy KPI shows 88% for residential projects but only 72% for commercial office projects. What action do you take?
Key takeaways
- Start with the right pilot: standardised drawings, repetitive elements, willing team. Residential new-build is often ideal.
- Run AI and manual in parallel during the pilot to build comparison data and team confidence.
- Expand methodically — each project type needs configuration adjustment, but the second expansion is faster than the first.
- Integrate with existing tools (CostX, Buildsoft, Excel) rather than replacing them.
- Change management is as important as technology — address fears directly, show results, and channel enthusiasm productively.
- Measure and report KPIs — time per tender, first-pass accuracy, discrepancies caught, and tender hit rate.
Next up: Capstone: Your Construction AI Blueprint.
Module 11 — Final Assessment
Why should the first AI estimation pilot use residential new-build rather than complex refurbishment?
What is the primary purpose of running AI and manual takeoff in parallel during the pilot?
Which KPI is the ultimate measure of AI estimation success, though it takes 12-24 months to assess fully?
Your QS team reports that AI saves time on measurement but descriptions are not NRM 2 compliant. Is this a reason to abandon AI adoption?