
Implementation
We evaluated multiple options for the AI generated icons we needed. Our main criteria
when looking for AI Tools here, was that it had to meet our needs for visual consistency,
control, and eciency. It also had to match the style of the preexisting Wallet OIM Icons, so
that it would not break the design style of the UI. Furthermore, the icons were being made
for a innumerate and illiterate audience, so the icons also could not be overly abstract or
artistic. Though the process was mostly smooth, there were some issues associated with
asset generation. Firstly, Gemini would quite frequently not understand the instructions, or
deviate from the style, which required us to rephrase and increase our iterations, to get it to
within a reasonable style. This was denitely a frustrating challenge to deal with. Another
challenge was that certain icons required multiple revision cycles. That is, we had to start
new conversations with Gemini, to clear the past context and start anew because the one
conversation was simply not producing acceptable results.
Firstly, we considered some specic image generation tools, such as Midjourney DALL
E 3, and Ideogram. However, there were signicant issues with these tools, namely, they
struggled to either get down the at, minimalist iconography style we wanted to preserve
from the pre-existing icons we were given by our partners. Or, they were inconsistent and
provided limited reproducibility. Next, we considered a directly integrated tool , which was
the Figma AI Plugin tool. We had experience using Figma, and this was integrated and
fast to use. However, it oî™»ered very little customization, and thus we could not control it
to produce acceptable icons. We nally decided to use Google Gemini after some test-
ing. We found that Gemini balanced control and creativity together the best. Additionally,
Gemini’s generated images were completely free for commercial use. Although it had a
weakness with stylization drift occurring, it was still the best tool we found for the task. Our
main workow for generating the icons was that we would pick a preexisting icon from the
codebase, and use it as a source of ground truth for Gemini to base its style oî™» of. Then,
we would ask it to generate a a version with our desired changes to the icons, and we
would iterate using conversation, until we arrived at something we wanted to keep. This
required multiple back and forths with Gemini to get a icon we desired. The workow for
a icon, took about 12.4 minutes. In addition to Gemini, we used Freepix on a limited ba-
sis (which does not use prompts) for generating new background tables (reducing size
by 6MB). Icons generated with Freepix are marked with ‘Freepix’; all other assets were
generated using Gemini, with the prompts available in Appendix .3.
Our AI-powered pipeline demonstrates signicant improvements in visual quality and
consistency across all icon categories. As shown in Appendix .1, the transformation from
legacy to AI-generateds exhibits enhanced clarity, addition of colours, and a more rened
styling with respect to older legacy code. The pipeline successfully enhances edge de-
nition and overall visual coherence while maintaining the core semantic meaning of each
icon. Across all comparisons, the AI-generated versions display superior line quality, aes-
thetic consistency, and a unied visual language that contributes to a more polished and
professional user interface. The complete icon gallery in Appendix ?? showcases both
the evolved icons and those created directly through the AI renement process. The con-
sistent improvement across these diverse icon types validates the eî™»ectiveness of our
2