May 10th, 2022

The H2O.ai Wildfire Challenge Winners Blog Series – Team Too Hot Encoder

RSS icon RSS Category: AI4Good, Community

Note: this is a community blog post by Team Too Hot Encoder – one of the H2O.ai Wildfire Challenge winners. You can check out their app here.

The Challenge

The aim of the project is to predict the probability of wildfire occurrence in Turkey for each month in 2020. As a result of these predictions, it is aimed to carry out more intensive monitoring studies in possible fire areas and to respond to fires very soon after they start. It is also aimed to derive generalizable relations by interpreting the model outputs and the importance attributed to each variable used by the model.

The Motivation

The wildfires in Turkey started in August 2021, spread over very large areas and resulted in the destruction of large areas and living things due to lack of intervention, have created a big agenda throughout the country. The public and politicians often complained about this technical inadequacy and suggested that improvements should be made in this regard. Within the scope of the project, it was desired to see whether an estimation could be made on this subject throughout the country, and if so, how successful the results would be.

The Goal

The goal of the project is to estimate the probability of a wildfire occurrence for each month of 2020 for each grid segment by dividing the area of Turkey in latitude and longitude with 1 degree precision. It is defined in the H2O Competition Overview as “Predicting the behavior of wildfires”.

Potential Audience

Since the probability of wildfire occurrence in certain areas in the future is being calculated, the following groups and individuals can benefit from this project:

  • Firefighters: Fire departments can keep firefighter density higher in risky areas, this way faster response can be provided in case of a wildfire.
  • Municipal Administrative Staff: The municipality administration can take protective and prohibitive precautions for various areas. Thus, the loss of life and property is minimized.
  • Civil Society Organizations (CSOs): Civil society organizations can find the opportunity to strengthen their networks in advance to collect aid in possible disaster situations in risky areas.

The Team

Anıl Öztürk entered the competition alone. Anıl is a Machine Learning Engineer with a Master’s Degree in Computer Engineering from Istanbul, Turkey. He has mostly worked on tabular data, deep learning and deep reinforcement learning. He is passionate about following state-of-the-art, competing in Kaggle and whining about stochasticity. He is trying to gain experience in different domains by participating in local and global competitions as much as possible. This competition was also very interesting for Anil because he had no experience with geospatial data.

The Methodology

Historical active fire and temperature observations are used as features. LightGBM (an advanced decision tree algorithm) was used in the project. The following factors were effective in choosing this algorithm:

  • Decision tree algorithms generally give better results than other statistical algorithms for tabular data. They are among the first algorithms to be tried.
  • LightGBM is often faster to train than similar decision-tree-based algorithms.
  • It has a library that makes it easy to use. Translation and transfer to different programming languages can be done.
  • It contains various evaluation and analysis methods. Thus, when the user wants to measure model performance, s/he does not have to write code from scratch or search for it.

The App

Within the scope of the competition, an interactive web application was requested. I designed an interface where users can make all the analyses and adjustments to the model. I tried to use self-explanatory visualizations whenever possible. Users can access non-technical details of the project, dataset explanations, dataset analysis graphics, model predictions and visualizations of that predictions, model evaluation and metrics screen from within the application.

The Key Takeaways

When I examined the feedback after the first-submission stage, I saw that the accuracy was one of the parts that had a low impact on the score. All scores and feedback statements were focused on usability, simplicity, and explainability. That’s why I paid the most attention to the following during the competition:

  • The application is prepared with the ease of download&run (Unfortunately, there were problems when a few libraries used had critical changes with their updates during the evaluation period.)
  • Non-technical personnel should be able to use the application easily, and the application should be easily deployed.
  • The app should be easy to navigate.
  • The application should hold the user’s hand as much as possible, and the explanation of even the things that a data scientist can easily understand should be added with the non-technical user in mind.
  • The application should also present critical metrics that technical staff will understand.

This competition has been very beneficial for me in terms of the mentality of considering corporate and stakeholder requirements while designing a solution. In most competitions, these real-life requirements can be overlooked when trying to maximize a score metric.

About the Author

H2O.ai Team

At H2O.ai, democratizing AI isn’t just an idea. It’s a movement. And that means that it requires action. We started out as a group of like minded individuals in the open source community, collectively driven by the idea that there should be freedom around the creation and use of AI.

Today we have evolved into a global company built by people from a variety of different backgrounds and skill sets, all driven to be part of something greater than ourselves. Our partnerships now extend beyond the open-source community to include business customers, academia, and non-profit organizations.

Leave a Reply

+
Enhancing H2O Model Validation App with h2oGPT Integration

As machine learning practitioners, we’re always on the lookout for innovative ways to streamline and

May 17, 2023 - by Parul Pandey
+
Building a Manufacturing Product Defect Classification Model and Application using H2O Hydrogen Torch, H2O MLOps, and H2O Wave

Primary Authors: Nishaanthini Gnanavel and Genevieve Richards Effective product quality control is of utmost importance in

May 15, 2023 - by Shivam Bansal
AI for Good hackathon
+
Insights from AI for Good Hackathon: Using Machine Learning to Tackle Pollution

At H2O.ai, we believe technology can be a force for good, and we're committed to

May 10, 2023 - by Parul Pandey and Shivam Bansal
H2O democratizing LLMs
+
Democratization of LLMs

Every organization needs to own its GPT as simply as we need to own our

May 8, 2023 - by Sri Ambati
h2oGPT blog header
+
Building the World’s Best Open-Source Large Language Model: H2O.ai’s Journey

At H2O.ai, we pride ourselves on developing world-class Machine Learning, Deep Learning, and AI platforms.

May 3, 2023 - by Arno Candel
LLM blog header
+
Effortless Fine-Tuning of Large Language Models with Open-Source H2O LLM Studio

While the pace at which Large Language Models (LLMs) have been driving breakthroughs is remarkable,

May 1, 2023 - by Parul Pandey

Request a Demo

Explore how to Make, Operate and Innovate with the H2O AI Cloud today

Learn More