Cloud Cost Optimization and Management – Meeting Notes
Overview
- Discussion centers on secure access, environment management, and cost optimization across production and non-production (non-prod) environments. Mention of multiple environments (NaviC, SSH, RDP) and access flow: authenticate to network, then to Jambox, then to type server for NaviC/Jambox access.
- Production environment has a large server footprint; estimated migration scope is >100 servers, mostly VMs, with past services still in use.
- Emphasis on making management traffic come from predictable IPs to enable access lists and tighter control.
- Historical context: James Broktar (left) had begun alignment on several items; action points needed to confirm what was flagged, particularly OS versions.
- Several cost optimization initiatives discussed: decommissioning, rightsizing, PaaS (Azure) optimization, storage cost optimization, and firewall/IDS/IPS POC with associated networking changes.
- Regional coordination required: needs engagement with regional IT teams and TCS (CloudOps) to implement recommendations and refresh reports.
- The conversation references a framework for budgeting and forecasting (PXC/PwC reporting), with a focus on aligning reported potential savings with actual actionable items in regional contexts.
- Roundtable closes with a push to finalize RI (Reserved Instances) planning, including a move from 1-year to 3-year reservations, to improve savings, and to finalize a model showing regional consumption and reservations.
Key concepts and terms (definitions and context)
- IES / IAS: Centrally managed vs. self-managed environments; central management discussed in PWC report figures. The team references IES (likely a specific environment) and the central management model for Azure/PaaS and databases.
- EBAT data platform: A large, heavy-user data platform likely sitting in centrally managed environments; suspected as a major driver of 105k Azure PaaS savings if right-sized or terminated.
- Samurai: A large environment that’s historically centralized; referenced as a likely high-cost, centralized workload.
- Data center licenses vs. Standard: Efforts to remove Enterprise/Datacenter licenses from Windows servers; SQL licenses remain the remaining area to convert to standard licensing.
- PaaS (Azure PaaS): Platform as a Service workloads in Azure, including Azure SQL and other Azure PaaS services; targeted for optimization with potential savings stated at 105,000.
- PWC report: A cost-optimization report that estimates savings in centrally managed environments and Azure PaaS workloads; some optimism vs. reality caveats noted.
- POC (Proof of Concept): IDS/IPS firewall PoC in the Americas; potential production rollout pending approvals and possibly requiring firewall upgrades.
- PXQ model: A cost-tracking or planning model for regional spend and cost optimization; emphasizes reflecting regional changes in future budgets.
- Rightsizing: Adjusting instance sizes to better-fit utilization; a major focus area for both compute and database workloads.
- Auto-off: Automation to power off non-prod or underutilized instances to reduce cost.
- Prism: A planning/optimization approach or tool referenced for reservation planning and cost optimization.
- Reservations: Move from shorter to longer-term commitments (e.g., from 12 months to 36 months) to lock in savings.
- TCS CloudOps: The outsourcing/partner team responsible for cloud operations and optimization activities.
- APAC/EMEA: Regions to be engaged for cost-optimization validation and budgeting.
Major points by topic
1) Security, access, and management posture
- Authentication flow: Network authentication -> Jambox -> type server -> NaviC environment or SSH/RDP access.
- Goal: Ensure management traffic originates from predictable IP addresses to enable strict access-lists.
- Production vs non-prod: Need consistent access-control measures; production requires stricter controls.
- OS version flags: Items flagged for OS versions require action; responsibility lies with the team after James Broktar’s departure.
2) Migration and production environment size
- Production environment has many servers; >100 servers to migrate.
- Mix of VMs and possibly past-services migrating to or being retained in the cloud.
- Emphasis on understanding service roles and management traffic to inform firewall rules and access controls.
3) Cost optimization: decommissioning and rightsizing
- Decommissioning variance: A cost-saving initiative; Steve acknowledged cost savings. Target savings reported: 2.8extk per month, or 35extk per year.
- 2.8extkextpermonth<br/>ightarrow2.8 ext{k} imes 12 = 33.6 ext{k}peryear,approximatingthereported35 ext{k}.</li></ul></li><li>Actionplan:HydratenumberswithregionalITteamsandwithTCS;requestTCStobeginimplementingtherecommendations(e.g.,Henrik’sflaggeditems).</li><li>PwC(PWC)reportalignment:Potentialsavingsof195{,}000incentrallymanagedenvironments;anotherlineshows105{,}000potentialsavingsforAzurePaaS,thoughthelatterassumessignificantterminationsandconsolidations.</li><li>Realitycheck:Notallsavingsmayberealized;needtoreviewwhatportioncomesfromEBATdataplatformorSamuraienvironments,astheyarelarge,centralizedworkloads.</li><li>AzureADPlistandDBs:AdamhaspulledAzureADPlists;DBSlistreferenced.Theteamwillrefreshreportstoidentifylarge,quick−winitemsforAzurePaaSanddata−platformworkloads.</li><li>Data−platformlicensing:Considermovingclustered/multi−DBSQLenvironmentsfromdatacenterlicensestostandardlicenses;SQLlicensingremainsthemainbarrierforcostsavings.</li><li>Nextsteps:AdamtorefreshAzurePaaSreport;identifylargestenvironmentsforquickwins;engageapplication/technicalownerstoconfirmrequirementsorcutifpossible.</li><li>Bob(Bobbry)involvement:Bobisrunningin−housetoolingtoanalyzedatabaseperformanceandwilladviseonright−sizingthedatabaselayer.</li><li>Governanceonhowtorespond:ForeachlineinthePwC/PaaSdata,regionsshouldconfirmYes/No/NeedsClarification;thenupdatethebudgetingsheets.</li></ul><h5id="4paasoptimizationanddataplatforms">4)PaaSoptimizationanddataplatforms</h5><ul><li>EBATdataplatformandSamuraienvironmentarepotentialmajorcontributorstoAzurePaaSspend.</li><li>Hypothesis:The105{,}000AzurePaaSsavingsmayprimarilycomefromterminatingorconsolidatinglargeEBAT/Samuraiworkloads.</li><li>Action:VerifytheAzurePaaSlist;confirmwhichworkloadsarecentralized(EBAT,Samurai)andwhichcanbeterminatedorright−sized.</li><li>Data−centerlicensingcontext:Somedata−centerworkloadsmigratedtocloudwhilelicensingremained;focusonmovingtostandardinstancesforWindowsandSQL.</li><li>Nextsteps:RefreshreportonAzurePaaSinstances;findquickwins;contactownersforvalidation;ifaworkloadmustremain,justifycapacityandperformanceneeds.</li></ul><h5id="5networkingidsipsandfirewallpoc">5)Networking,IDS/IPS,andfirewallPoC</h5><ul><li>IDS/IPSPoCinAmericas:POCrundone;awaitingapprovalfromJamesfromAmericatoproceed(likelyasAmericas−firstdeployment).</li><li>Firewallchanges:IfPoCproceeds,somefirewallupgradesmayberequired;clarifywhethernewfirewallhardwareisneededorifupgradessuffice.</li><li>PXQmodel:Anyfirewall/networkchangesshouldbereflectedinthePXQbudgetingmodel;snapshotsofregionalnetworksweretakeninJuly;Americasmayincurmajorchanges.</li><li>Budgetimpact:IfthePoCgoesproductionintheAmericas,budgetsshouldreflectincreasedcostsinthatregion;costsmaybechargeddirectlytosub−subscriptionsineachregion.</li><li>Action:Determinewhatisrequiredfromtheteam’ssidefordeploymentandupgrade,andalignwithregionalbudgets.</li></ul><h5id="6storageoptimizationandbackups">6)Storageoptimizationandbackups</h5><ul><li>StoragecostoptimizationongoingwiththeCloudOpsteam(TCS):trimmingredemptionpoliciesandcuttingbackupsinnon−prodenvironmentswherefeasible.</li><li>Observedtrend:StoragesavingsreportedforJulyandAugust;theteamistrackingadecreaseinreportedstoragesavingsduetoorganicgrowthandnewbackups.</li><li>Backupgrowth:Backupsaddincrementalcostsannually;asarchivesandbackupsmature,costsriseslightly;expectayearlyincreasetowardyear−end.</li><li>Auto−offroutine:Consideraddingautomaticpower−offfornon−prodorunusedresourcestobolstersavings.</li><li>Tractiontracking:Trackauto−offadoption,storagereductions,andrightsizingtomeasureimpact.</li><li>Datapointsnoted:InAugust,RI(ReservedInstances)adjustmentsshowadropofabout9permonthinthereports;amoredetailedmodelingisneededtoreconcilethiswithsubscription−levelchanges.</li><li>Regionalcollaboration:Trackbackwithregionalteamstoensurebackupsaretrimmedconsistently;confirmwhethersomeworkloadscanbeentirelyremovedfrombackups.</li><li>Prismandlong−termreservations:Discussmovingfromyear−longreservationsto3−yearreservationstoimprovesavings;usePrismorasimilarmodeltoplanoptimalreservationmix.</li></ul><h5id="7reservationsriplanningandregionalbudgeting">7)Reservations,RIplanning,andregionalbudgeting</h5><ul><li>RIrefreshwork:Apersonisbuildingahigh−levelbudgetviewbyregion/subscriptiontodeterminewhethertokeep,resize,orterminatereservations.Thegoalistoshowaclearpathto26<li>Proposedshift:Movefromone−yearcommitmentstothree−yearcommitmentstoimprovesavingsandalignwithlong−termcapacityplanning.</li><li>Challenges:Complexscriptlogictomapreservationsizestoactualworkloads;needtobalancerisk(underutilizedreservations)withpotentialsavings.</li><li>Timeline:Beginwithaquick,regional,high−levelviewtoguidebudgetsfor2020/2021planning;implementearlierratherthanlatertomanagerisk.</li><li>Nextsteps:Shareinitialviewswithcolleaguestoreviewreservationsizingandupdatethebudgetsheetsaccordingly.</li></ul><h5id="8regionalcoordinationandresponsibilities">8)Regionalcoordinationandresponsibilities</h5><ul><li>Regionsinvolved:APAC,EMEA;needtocoordinatewithregionalITteamsandTCSforactioningrecommendationsandreporting.</li><li>Communicationcadence:Weeklycallswithregionalteams;tracktractiononauto−off,storage,andrightsizing.</li><li>Ownership:Identifyapplicationowners/technicalownersforAzurePaaSworkloads;obtainconfirmationonrequirementsorpermissiontoterminate/scaledown.</li></ul><h5id="9roundtablecurrentstatusandongoingconcerns">9)Roundtable:currentstatusandongoingconcerns</h5><ul><li>RIworkinprogress:OneparticipantisclosetofinalizingaviewofRIallocationsbyfamilysize,region,andsubscription;aimingtounderstand20−30<li>Forecastingadjustments:Expectadropinsavingswhenworkloadsmovetopower−offoraredecommissioned;alsoexpectsavingstoadjustasnewworkloadsareintroducedorre−sized.</li><li>Thescriptchallenge:BalancingreservationsacrossmultipleD−seriesorcomputefamilies;theprocessiscomplexduetohowreservationsareprovisionedversushowcomputeisconsumedacrossregions.</li><li>Unifiedcomputeacrossthreeregions:Acknowledgethatcomputeisnowunifiedacrossregions,enablingbettercostefficiencyandeasierrightsizing.</li><li>Earlyactions:RequestearlyviewofRIplanningtobeginoptimizingreservationssoonerratherthanlater.</li></ul><h4id="numericalreferencesandformulaslatex">Numericalreferencesandformulas(LaTeX)</h4><ul><li>Monthlydecommissioningsavings:2.8 ext{k} ext{ per month}</li><li>Annualizeddecommissioningsavings:2.8 ext{k} imes 12 = 33.6 ext{k} ext{ per year} ext{(approximately }35 ext{k} ext{ per year reported)}</li><li>Centralizedsavings(PwC):195{,}000</li><li>AzurePaaSsavings(PwC):105{,}000</li><li>Targetreservationsavingsreference:26 ext{ ext{%}}(ascitedinplanningdiscussions)</li><li>Reservationhorizonproposal:12 ext{ months}
ightarrow 36 ext{ months}$$ - Storage/funding notes: Regional budgets will reflect costs directly charged to sub-subscriptions in each region; alignment across APAC/EMEA is needed
Action items (summary)
- Clarify OS-version flags and ownership for items flagged after James Broktar’s departure; assign responsible owner.
- Update Azure PaaS inventory; Adam to refresh Azure PaaS report; identify EBAT/Samurai workloads for potential right-sizing or termination.
- Engage with Paprika/Merkel contact (database team) to verify DB license status and potential consolidation to standard licenses; confirm DBA (DBS) list and responsibilities.
- Validate POC roadmap for IDS/IPS firewall in Americas; determine deployment scope and required firewall upgrades or replacements.
- Hydrate PwC regional figures with regional IT and TCS inputs; update the PWC-based cost-savings estimates by region.
- Develop a regional RI plan: build an initial view of family sizes and 3-year reservation opportunities; map to budget lines and share for review.
- Implement storage optimization actions: extend auto-off routines; trim backups for non-prod environments where feasible; track monthly savings and update baselines.
- Align regional budgeting with PXQ model: reflect expected costs for changes in Americas; ensure regional teams budget for potential spikes.
- Schedule follow-up on license reallocation from Data Center to Standard for Windows servers; track SQL licensing separately.
- Maintain ongoing right-sizing efforts (compute and databases) with Bob (Bobbry) and regional owners; report progress in weekly calls.
- Prepare a consolidated, regional cost-optimization dashboard: RI status, right-sizing, decommissioning, and storage, with Yes/No/Needs Clarification for PwC lines.
Notes on sentiment and risk
- Optimism about potential savings exists but requires careful validation against actual workload footprints and regional approvals.
- Some initiatives (like POC deployment in Americas) depend on approvals and concrete changes to security infrastructure; delays could shift timelines and budgets.
- The team recognizes the complexity of reservations planning across multiple regions and compute families; a rigorous, data-driven approach is essential to avoid over-committing or under-committing resources.
References to people and roles (contextual)
- James Broktar: Previously involved in alignment; left the team; action items depend on what he flagged.
- James Procter: Mentioned as the correct James in some parts; not to be confused with James Norris.
- Henrik: Flagged some recommendations for action.
- Adam: Pulled Azure ADP lists; to refresh Azure PaaS report.
- Bob/Bobbry: Runs internal tooling to analyze database performance for right-sizing.
- Paprika: A contact from Merkel group referenced in discussions about data center and licensing.
- Callers/regions: APAC, EMEA, Americas; regional IT teams and TCS CloudOps to be engaged for execution and budgeting.
Summary takeaway
- The team is actively pursuing a multi-pronged cost-optimization program spanning decommissioning, rightsizing, storage optimization, and reservation planning, with a strong emphasis on regional alignment and governance. There is a clear pathway to significant savings, but realization depends on timely validation with regional owners, accurate inventory of large workloads (EBAT/Samurai), and concrete changes in regional budgets and subscriptions. The next steps are to refresh key reports, confirm ownership, implement quick wins, and finalize RI planning to enable more aggressive long-term savings.