2024 F1 Bahrain Grand Prix Post-Race Report

Position Changes Over The Race Distance

Formula 1 Drivers Position Progression over the race distance graph

Max Verstappen started from pole and converted it to a dominant win leading all 57 laps of the F1 2024 Bahrain Grand Prix. The most impressive performance came from Lance Stroll with him surging from 20th to 10th place after having a first lap spin to secure a double points finish for Aston Martin. The Alpines starting in last were not able to improve much finishing in 17th and 18ths. It looks like it may be a long season for the French outfit.

Current Standings

Formula 1 Drivers Championship Standings Following the 2024 F1 Bahrain Grand Prix
Formula 1 Constructor Championship Standings Following the 2024 F1 Bahrain Grand Prix

With a one-two finish, Red Bull Racing secures the top spot in F1 Constructor Championship standings at 44 points. Max Verstappen continues his dominant form into the 2024 season securing both the race victory as well as the fastest lap to give him 26 points on the weekend. Sergio Perez and Carlos Sainz round out the podium with Charles Leclerc finishing just off the podium. With Ferrari finish 3-4 in the race, they secure second with Mercedes behind for third. Of the ten teams, only 5 teams scored points. The most shocking performance of the weekend has to be Alpine with both cars starting in last and second-to-last and finishing far outside the points. This is a steep drop in performance compared to last year where they secured 6th in Championship with a gap to 7th being 92 points.

Lap Time Analysis

Interpreting The Data

In the comprehensive analysis of lap times for the first round of F1 in Bahrain, a sophisticated multiple linear regression model was meticulously crafted, encapsulating two pivotal independent variables: fuel load, which methodically decreases from a full state (1) to empty (0) over the race distance, and tyre age, quantified by the laps since the tyre was new. The focal point of this model, the dependent variable, is the lap time, a critical measure of performance in the high-stakes environment of Formula 1 racing.

The model is structured as follows:

  • Beta-0: The Intercept serves as the foundational lap time prediction, established when both independent variables—fuel load and tyre age—are at their baseline values, embodying a scenario with a brand new tyre equipped on a vehicle at full fuel capacity.
  • Beta-1, a meticulously calculated coefficient, delineates the impact on lap time for each unit alteration in fuel load, with all other variables maintained constant. This coefficient captures the dynamic influence of reducing fuel weight on vehicle performance.
  • Beta-2 similarly quantifies the variation in lap time per unit change in tyre age, isolating the effects of wear and degradation on tyre performance over time, while holding fuel load steady.

The precision and reliability of this model are evaluated through two paramount metrics:

  • Mean Squared Error (MSE): This indicator computes the average of the squared discrepancies between actual and model-predicted lap times. A lower MSE denotes a model of higher fidelity, capable of closely mirroring real-world outcomes.
  • R² (Coefficient of Determination): R² articulates the proportion of variance in lap times that the model’s independent variables elucidate. An R² value at zero signifies that the model does not enhance our understanding beyond the baseline mean lap time prediction. Conversely, elevated R² values herald a model that adeptly captures the intricacies of lap time variance. Negative R² values, though less common, suggest a misalignment between the model’s assumptions and the empirical data, potentially indicating the need for model refinement or reconsideration of underlying assumptions.

To augment the robustness of this model and ensure its analytical precision, a rigorous data curation process was implemented, meticulously identifying and excluding outliers from the dataset. Outliers, or data points significantly deviating from the general dataset pattern, can skew results, misleadingly influence model parameters, and obscure the true relationship between variables. Their exclusion is a critical step in refining the model’s accuracy and reliability. This process involves sophisticated statistical techniques to detect anomalous data points that represent extraordinary circumstances, such as mechanical failures, racing incidents, or atypical weather conditions, which do not reflect the normal racing dynamics the model seeks to capture.

The inclusion of tyre compounds—hard, medium, and soft—further sophisticates the model, introducing a categorical variable that accounts for the diverse performance characteristics inherent to each tyre type. This addition allows for a nuanced analysis of how different compounds affect lap times, influenced by their varying levels of grip and durability. By integrating interaction terms between tyre compound and tyre age, the model adeptly captures the compound-specific degradation rates and their distinct impacts on performance.

This refined interpretation, bolstered by the exclusion of outliers and the consideration of tyre compounds, presents a comprehensive and nuanced framework for predicting F1 lap times. It underscores the complexity of the sport’s dynamics and the sophistication required in analytical models to accurately forecast outcomes. As the field of sports analytics continues to evolve, such models stand at the forefront, offering insights that drive strategic decisions and enhance our understanding of competitive motorsports.

Limitations of the Current Approach

The approach of using multiple linear regression to predict Formula 1 lap times based on fuel load and tyre age, while informative, has several limitations that could impact its accuracy and applicability. Understanding these limitations is crucial for interpreting the model’s results and considering potential areas for improvement:

  1. Linearity Assumption: The model assumes a linear relationship between the independent variables (fuel load and tyre age) and the dependent variable (lap time). However, the actual relationships may be more complex and non-linear, especially considering the dynamic and highly technical nature of F1 racing.
  2. Constant Variance (Homoscedasticity): Multiple linear regression assumes that the variance of error terms is constant across all levels of the independent variables. In the context of F1, factors such as changing track conditions, weather, and driver behavior could introduce variability that violates this assumption.
  3. Independence of Errors: The model presupposes that the errors of the prediction are uncorrelated with each other. In sequential data like lap times, adjacent laps may be correlated (e.g., due to a safety car period), which can violate this assumption and affect the model’s accuracy.
  4. Exclusion of Other Relevant Variables: The model only includes fuel load and tyre age as predictors. Other critical factors influencing lap time, such as driver skill, car setup, track temperature, weather conditions, and strategic decisions (like pit stops), are not accounted for. This omission could lead to an oversimplified model that fails to capture the complexity of F1 race dynamics.
  5. Impact of Outliers: F1 data can be prone to outliers due to crashes, mechanical failures, or unusual race incidents. These outliers can significantly affect the model’s performance and its predictive accuracy.
  6. Generalization Across Tracks and Conditions: The model’s applicability might be limited to specific conditions or tracks and may not generalize well across different circuits or racing conditions without adjustments or recalibration.
  7. Temporal Dynamics: The model does not account for the temporal progression within a race or across seasons. Changes in regulations, car performance improvements, and track alterations from one season to another are not considered, potentially limiting the model’s predictive power over time.
  8. Interaction Effects: The model might not adequately capture the interaction effects between variables, such as how the impact of tyre wear might change at different fuel levels. In reality, the effect of one variable could depend on the level of another, necessitating a more complex modeling approach to capture these interactions.

Addressing these limitations requires more sophisticated modeling techniques, such as non-linear models, time series analysis, or machine learning approaches that can handle complex interactions, non-linear relationships, and temporal dynamics. Additionally, incorporating a broader set of variables and accounting for the unique characteristics of each Formula 1 race could improve the model’s predictive accuracy and relevance.

Red Bull Racing

Max Verstappen

Max Verstappen lap time chart over the full race distance of the 2024 F1 Bahrain Grand Prix
CompoundTyreLapsBeta-0Beta-1Beta-2MSE
C1H1893.6533989090092-0.001255280.069040460.03686890605676087-0.6386180469672065
C3S3497.317868772595470.00085042-0.046773311.6477418863316247-0.13302487272311914

Sergio Perez

Sergio Perez lap time chart over the full race distance of the 2024 F1 Bahrain Grand Prix
CompoundTyreLapsBeta-0Beta-1Beta-2MSE
C1H2295.75924633084067-0.000382850.021056830.159675437960507980.002028512746832245
C3S3097.62857698512940.00097263-0.053494870.38867585079246790.7776524610117771

Ferrari

Charles Leclerc

Charles Leclerc lap time chart over the full race distance of the 2024 F1 Bahrain Grand Prix
CompoundTyreLapsBeta-0Beta-1Beta-2MSE
C1H4397.645221241006850.00082508-0.045379290.207761749403021170.7250212140254124
C3S997.15960957416675-0.002380170.130909110.2947448979591776-12.099773242630361

Carlos Sainz

Carlos Sainz lap time chart over the full race distance of the 2024 F1 Bahrain Grand Prix
CompoundTyreLapsBeta-0Beta-1Beta-2MSE
C1H4096.873883897026970.00063049-0.034676740.400986294531217260.3679033780788691
C3S1297.6406869514487-0.00017770.009773310.07243432893271697-0.3038179207889051

Mercedes AMG

Lewis Hamilton

Lewis Hamilton lap time chart over the full race distance of the 2024 F1 Bahrain Grand Prix
CompoundTyreLapsBeta-0Beta-1Beta-2MSE
C1H4297.629972308665170.00076068-0.041837630.363863205378559760.4542051919321607
C3S1098.272348838799376.26752057e-05-3.44713631e-030.029024970273486463-0.2899986788216449

George Russell

George Russell lap time chart over the full race distance of the 2024 F1 Bahrain Grand Prix
CompoundTyreLapsBeta-0Beta-1Beta-2MSE
C1H4397.56437146712970.00066695-0.036682140.3383579376829686-0.021886388975404625
C3S996.97376074025115-0.002250340.123768610.5188775510204018-0.22811254679385584

McLaren

Lando Norris

Lando Norris lap time chart over the full race distance of the 2024 F1 Bahrain Grand Prix
CompoundTyreLapsBeta-0Beta-1Beta-2MSE
C1H4197.327775366525060.00063543-0.034948630.425054311241215070.021336008796519756
C3S1197.54920573421154-0.000897250.049348480.0454871360766283-1.9241730334975689

Oscar Piastri

Oscar Piastri lap time chart over the full race distance of the 2024 F1 Bahrain Grand Prix
CompoundTyreLapsBeta-0Beta-1Beta-2MSE
C1H4297.412453008330670.0006251-0.034380670.33756376501106980.4775952432958228
C3S1098.01486627390206-0.000203690.011203190.0320396105826364-2.2039610582635487

Aston Martin

Fernando Alonso

Fernando Alonso lap time chart over the full race distance of the 2024 Bahrain Grand Prix
CompoundTyreLapsBeta-0Beta-1Beta-2MSE
C1H3999.066555734898020.00125914-0.069252650.41738741132863730.6652532039469572
C3S1398.42613052469991-0.00031270.017198620.071490345704708520.10637067869112649

Lance Stroll

Lance Stroll lap time chart over the full race distance of the 2024 Bahrain Grand Prix
CompoundTyreLapsBeta-0Beta-1Beta-2MSE
C1H4597.887780812910550.00055757-0.030666190.246188021726953780.04312717083092732
C3S1398.08207204230008-0.001999340.109963650.05105000000000491-1.2688888888891499

Sauber

Valterri Bottas

Valterri Bottas lap time chart over the full race distance of the 2024 Bahrain Grand Prix
CompoundTyreLapsBeta-0Beta-1Beta-2MSE
C1H42101.721613227294850.00192917-0.104175071.2310352559849802-2.077588139962458
C3S1098.59878742921991-0.001324580.071527190.022742642687281343-0.010784119434745332

Zhou Guanyu

Zhou Guanyu lap time chart over the full race distance of the 2024 Bahrain Grand Prix
CompoundTyreLapsBeta-0Beta-1Beta-2MSE
C1H4497.552971887734590.00020329-0.01097750.101413613493141290.443461877171787
C3S797.10595131984917-0.005738770.309893730.8586500000000041-37.16222222221951

Haas

Nico Hulkenberg

Nico Hulkenberg lap time chart over the full race distance
CompoundTyreLapsBeta-0Beta-1Beta-2MSE
C1H3697.761021165874610.0003058-0.01651310.240247589044417310.29208813541239964
C3S1593.86952071989874-0.000593180.03203169145.46106798443932-0.5799155365064259

Kevin Magnussen

Kevin Magnussen lap time chart over the full race distance
CompoundTyreLapsBeta-0Beta-1Beta-2MSE
C1H4298.262243304502990.00055116-0.029762460.67052897271856820.2586288999426155
C3S998.62953131886968-0.000925610.049982860.08760204081632081-0.40163265306113294

Racing Bulls

Daniel Ricciardo

Daniel Ricciardo lap time chart over the full race distance
CompoundTyreLapsBeta-0Beta-1Beta-2MSE
C1H2097.08212333309224-0.000634010.034236490.118241234777376790.3498763723580687
C3S3199.136428622136650.00116299-0.062801590.83960645436401660.4991390764081234

Yuki Tsunoda

Yuki Tsunoda lap time chart over the full race distance
CompoundTyreLapsBeta-0Beta-1Beta-2MSE
C1H3998.077912921147120.00045908-0.024790460.274124844566262070.45020400964460094
C3S1298.04637996679071-0.001222940.066038810.09183737242907303-40.32681759308756

Williams

Alex Albon

Alex Albon lap time chart over the full race distance
CompoundTyreLapsBeta-0Beta-1Beta-2MSE
C1H3899.05983185275990.0009851-0.053195270.257850924237139230.6467039359628166
C3S1398.31446611938667-0.000988640.053386710.065843739160595880.0

Logan Sargeant

Logan Sargeant lap time chart over the full race distance
CompoundTyreLapsBeta-0Beta-1Beta-2MSE
C1H26102.036849261787220.00282718-0.149840741.6461904330371864-0.9185126445237588
C3S2299.2658284053580.00119151-0.063150110.9408265437209240.619160239750274

Alpine

Pierre Gasly

Pierre Gasly lap time chart over the full race distance
CompoundTyreLapsBeta-0Beta-1Beta-2MSE
C1H2799.117876055010680.00109962-0.059379540.27994245286493720.18791875075442044
C3S2299.44270268805910.00141498-0.076409090.133883972318555370.9572310336319462

Esteban Ocon

Esteban Ocon lap time chart over the full race distance
CompoundTyreLapsBeta-0Beta-1Beta-2MSE
C1H4398.429266285436640.00044578-0.024072070.229108674371542560.37557864656478124
C3S898.56542063764141-0.00136990.073974630.0070599999999993510.8235000000000212

Conclusion

Reflecting on the Visualization and Analysis

The data visualization component of this Formula 1 analysis represents a significant triumph in transforming intricate quantitative data into an intuitive and engaging narrative for viewers. This success manifests in several key areas:

  • Intuitive Understanding: The visualizations bridge the gap between complex data and the viewer’s comprehension, proving invaluable for those unfamiliar with the sport’s statistical depths or for new fans.
  • Strategic Insights: They illuminate the nuanced strategies of the teams, detailing pit stop timings and tyre choices, and how these decisions unfold over the race distance.
  • Trend Identification: The progression of lap times enables identification of trends such as the impacts of fuel burn-off and tyre performance, offering a deeper insight into race outcomes.
  • Engagement and Narrative: The data tells a compelling story, enhancing viewer engagement by allowing fans to follow the race’s narrative and understand the strategic elements at play.
  • Performance Benchmarking: Comparative analysis across drivers and teams benchmarks performance, fostering transparency and appreciation of the competitive nature of the sport.
  • Predictive Elements: For the more analytically inclined, these visualizations lay the groundwork for predictive analysis, speculating on future performance trends.

Conversely, the data analysis component, especially the multiple linear regression model, acknowledges the necessity for substantial refinement. The inconclusive results from the current model indicate an invaluable opportunity for iteration and improvement. The modifications proposed are essential for developing a model that can offer precise and insightful analysis in future races.

This process exemplifies the iterative nature of design and analysis. It highlights that perfection is a journey involving continuous refinement and the application of feedback. Through this process, we aim to enhance the analysis’s accuracy and utility, laying the groundwork for sophisticated tools that can provide actionable insights into race strategy and tyre performance.

The aim to improve the regression model goes beyond academic exercise; it’s a stepping stone towards creating a robust predictive tool and simulation environment. This tool will be capable of forecasting the outcomes of race strategies, allowing for the evaluation of the practicality and effectiveness of various approaches in comparison with actual race data.

Formula 1’s complexity, with its myriad variables and conditions, requires a model that can accommodate and reflect its nuances. The current model’s limitations have been clearly highlighted, pointing to areas where it can be enhanced. Future iterations will progressively integrate factors such as track conditions, ambient temperatures, tyre degradation, DRS usage, traffic impact, engine performance, and the overarching strategies of drivers. This will ensure that the model evolves to mirror the sport’s multifaceted nature more accurately.

In conclusion, the development of this analytical model is a testament to the iterative design process. Each phase of refinement brings us closer to a predictive tool that can recount past races and anticipate future events on the track. This ongoing enhancement is pivotal for yielding deeper, more precise insights to inform strategic decision-making in the dynamic and fast-paced world of Formula 1 racing. Combined with the success of the data visualization, this evolving analysis is set to transform the understanding and enjoyment of the sport for teams, analysts, and fans alike.

Modifications to the Approach

To refine the multiple linear regression model used for predicting Formula 1 lap times, several modifications can be implemented to address the complexities of racing dynamics:

  1. Splitting the Model by Stints: Instead of using tyre compounds as the primary segmenting factor, the model could be split by race stints. Each stint is influenced by different variables such as tyre condition at the stint’s start, fuel load, and strategic objectives. This would allow for a more granular analysis of performance across different phases of the race.
  2. Incorporating Track and Ambient Temperature: Both track and ambient temperatures significantly affect tyre behavior and car performance. The model should include these as independent variables or as part of an interaction term with tyre compounds and age.
  3. Accounting for Track Evolution: As a race progresses, the track ‘rubbers in’, which can lead to faster lap times. The model can be modified to include a variable representing the race’s progress or specific lap number to capture this effect.
  4. Tyre Warmup and Operating Range: Tyres have an optimal temperature range for peak performance. The model could be adjusted to include a variable that captures whether tyres are within this operating window, with potential effects on lap times during the warmup period or if temperatures fall outside the optimal range.
  5. Degradation Factors Beyond Wear: Incidents like Logan Sargeant’s lockup can cause abnormal wear or flatspotting. The model could be enriched by including a degradation factor that accounts for such events, perhaps through a binary variable indicating significant wear events.
  6. Car Weight Instead of Fuel: Rather than just tracking fuel load, the model could use overall car weight, which would factor in both the decreasing weight from fuel burn and any additional weight that might affect performance, like ballast.
  7. Effect of DRS on Lap Times: The Drag Reduction System (DRS) allows a following car to reduce aerodynamic drag and potentially achieve faster lap times in certain sections. This could be included as a binary variable to indicate whether DRS was used on a given lap.
  8. Impact of Traffic: Being in traffic can slow down a car, especially if battling with other cars for position. The model can include a variable for the number of cars within a certain distance to the car in question to represent the potential impact of traffic.
  9. Engine Performance: Different engine manufacturers may perform differently. A categorical variable could represent the engine supplier, capturing any systematic differences in performance, such as the noted underperformance of the Renault engine.
  10. Driver Strategy: Driver behavior, such as pushing hard to overtake or conserving tyres and fuel, impacts lap times. This can be modeled by including variables for overtaking maneuvers, defensive driving, or periods of ‘lifting and coasting.’
  11. Non-Linearity in Variables: Some relationships (like tyre wear) may not be linear. Including polynomial terms or using non-linear modeling techniques could provide a better fit for such variables.
  12. Random Effects: Given that drivers and cars are unique, a random effects model could account for the unobserved heterogeneity between different drivers and cars.
  13. Time-Series Analysis: Using a time-series model could capture autocorrelation in lap times, where the performance on one lap is related to the previous lap(s).
  14. Qualifying Performance: The performance in qualifying could be a predictor of race pace and could be included to account for inherent car and driver speed.

By incorporating these modifications, the predictive model would become more nuanced and reflective of the multifaceted nature of Formula 1 racing, leading to more accurate and meaningful insights.

Closing Remarks

Concluding this report on the 2024 F1 Bahrain Grand Prix, we underscore the pivotal role of data visualization in demystifying the complexities of Formula 1 for fans and analysts alike. The success of these visual tools in portraying the intricacies of race strategy and performance sets a high standard for future reports.

As we move forward, each report will embody an iterative design philosophy, where feedback and new insights lead to continual refinements. This process ensures that our data analysis models become increasingly sophisticated, offering more nuanced understandings of the dynamics at play in each race.

The performance of Max Verstappen and Red Bull in Bahrain hints at a season where data-driven insights will be crucial for unpacking the championship narrative. As such, future reports will not only be more involved but also refined and improved, reflecting the evolving landscape of Formula 1 and the deepening competition.

In essence, the journey of enhancing our analytical capabilities is parallel to the unfolding drama of the F1 season itself. Each Grand Prix provides fresh data and new stories, fueling the cycle of improvement that promises to make each subsequent report an even richer source of insights into the sport we love.

Discover more from Drive Data Dynamics

Subscribe now to keep reading and get access to the full archive.

Continue reading