Main Article Content
Abstract
To solve the issue of the digital transformation of Chinese manufacturing in terms of the bottleneck between industrial interfaces not being able to adapt to heterogeneous operators and the high cognitive load imposed on them, the authors propose the SkeuoUI-Gen framework based on the adaptation of skeuomorphic design principles and the use of conditional diffusion models to produce personalized industrial interfaces in the context of Chinese manufacturing. In this regard, the experiment used a within-subjects design involving 250 manufacturing industry operators (diverse in age, experience, and industry sectors) to evaluate three interface types: traditional flat interfaces, fixed skeuomorphic interfaces, and personalized adaptation interfaces. The experiment used objective evaluations (FID and PSNR) and subjective evaluations (SUS score and cognitive load), and trained the model on multiple sources: 50,000 interaction logs from operators and 50,000 screenshots of industrial user interfaces. The experiment found that the personalized adaptation interface resulted in a 78.6% SUS score (an increase of 15.4% compared to the traditional baseline), improved efficiency by 24.7%, and reduced serious safety-related errors by 52% and 67%. The network achieved a lower FID (21.5) than GAN-based approaches and required only 2.3 seconds per generation. In addition, the network presented robustness through multi-dimensional validation. This framework expands the cognitive load theory and the technology acceptance model.
Keywords
Article Details
References
- Rombach, R., Blattmann, A., Lorenz, D., Esser, P., & Ommer, B. (2022). High-resolution image synthesis with latent diffusion models. Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 10684-10695. https://doi.org/10.1109/CVPR52688.2022.01042
- Park, S.-G., Quan, W., Jin, H., Kim, S., & Park, J. (2023). UI layout generation with LLMs guided by UI grammar. arXiv preprint, arXiv:2310.15455. https://arxiv.org/abs/2310.15455
- Huang, F., Cai, G., Chen, X. L., He, G., Cheng, Z., Yang, J., & Yao, W. (2024). PosterLlama: Bridging design ability of language models to contents-aware layout generation. arXiv preprint, arXiv:2406.02884. https://arxiv.org/abs/2406.02884
- Mourtzis, D., Angelopoulos, J., & Panopoulos, N. (2023). The future of the human–machine interface (HMI) in society 5.0. Future Internet, 15(5), 162.https://doi.org/10.3390/fi15050162
- Carrera-Rivera, A., Larrinaga, F., Lasa, G., Martinez-Arellano, G., & Unamuno, G. (2022). How-to conduct a systematic literature review: A quick guide for computer science research. MethodsX, 9, Article 101895. https://doi.org/10.1016/j.mex.2022.101895
- Cheng, Z., Liang, J., Bai, X., Yang, X., & Xu, H. (2024). Play to your strengths: Collaborative intelligence of conventional machine learning and large language models. arXiv preprint, arXiv:2411.09753. https://arxiv.org/abs/2411.09753
- Song, X., & Shao, Y. (2024). A comparative analysis of user interface design in foreign language intelligent tutoring systems: Effectiveness, user experience, and pedagogical implications. PeerJ Computer Science, 10, e2336. https://doi.org/10.7717/peerj-cs.2336
- Bai, Y., Kadavath, S., Kundu, S., Askell, A., Kernion, J., Jones, A., ... & Kaplan, J. (2022). Constitutional AI: Harmlessness from AI feedback. arXiv preprint, arXiv:2212.08073. https://arxiv.org/abs/2212.08073
- Niu, H., Li, Y., Tang, F., Zhou, H., Chen, H., Chen, Y., ... & Wang, J. (2023). Parameter-efficient long-tailed recognition. arXiv preprint, arXiv:2309.10019. https://arxiv.org/abs/2309.10019
- Alves, T., Lima, M. A., & Gaspar, L. (2024). GenAI Chatbot User Interfaces. In Extended Abstracts of the 2024 CHI Conference on Human Factors in Computing Systems (CHI EA '24), Article 393, 1-5. https://doi.org/10.1145/3613905.3650940
- Ragin, C. C. (2008). Redesigning social inquiry: Fuzzy sets and beyond. University of Chicago Press. https://doi.org/10.7208/chicago/9780226702797.001.0001
- Ho, J., Jain, A., & Abbeel, P. (2020). Denoising diffusion probabilistic models. Advances in Neural Information Processing Systems, 33, 6840-6851. https://arxiv.org/abs/2006.11239
- Nielsen, J. (1994). Usability engineering. Morgan Kaufmann Publishers. https://doi.org/10.1016/C2009-0-21512-1
- Norman, D. A. (2013). The design of everyday things: Revised and expanded edition. Basic Books. https://www.hachettebookgroup.com/titles/don-norman/the-design-of-everyday-things/9780465050659/
- Hassenzahl, M., & Tractinsky, N. (2006). User experience – A research agenda. Behaviour & Information Technology, 25(2), 91-97. https://doi.org/10.1080/01449290500330331
- Goodfellow, I., Pouget-Abadie, J., Mirza, M., Xu, B., Warde-Farley, D., Ozair, S., Courville, A., & Bengio, Y. (2014). Generative adversarial nets. Advances in Neural Information Processing Systems, 27, 2672-2680. https://arxiv.org/abs/1406.2661
- Vaswani, A., Shazeer, N., Parmar, N., Uszkoreit, J., Jones, L., Gomez, A. N., Kaiser, Ł., & Polosukhin, I. (2017). Attention is all you need. Advances in Neural Information Processing Systems, 30, 5998-6008. https://arxiv.org/abs/1706.03762
- Brown, T., Mann, B., Ryder, N., Subbiah, M., Kaplan, J. D., Dhariwal, P., ... & Amodei, D. (2020). Language models are few-shot learners. Advances in Neural Information Processing Systems, 33, 1877-1901. https://arxiv.org/abs/2005.14165
- Zhang, L., Rao, A., & Agrawala, M. (2023). Adding conditional control to text-to-image diffusion models. Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV), 3813-3824. https://doi.org/10.1109/ICCV51070.2023.00355
- Cao, M., Wang, X., Qi, Z., Shan, Y., Qie, X., & Zheng, Y. (2024). Controllable generation with text-to-image diffusion models: A survey. arXiv preprint, arXiv:2403.04279. https://arxiv.org/abs/2403.04279
- Zhang, Z., Zhang, L., Jia, J., Zhong, Y., Deng, Y., Zhou, J., ... & Yu, J. (2023). A survey on text-to-image diffusion models. arXiv preprint, arXiv:2303.07909. https://arxiv.org/abs/2303.07909
- Zhao, S., Chen, D., Chen, Y. C., Bao, J., Hao, S., Yuan, L., & Wong, K. Y. K. (2023). Uni-ControlNet: All-in-one control to text-to-image diffusion models. Advances in Neural Information Processing Systems, 36, 55450-55470. https://arxiv.org/abs/2305.16322
- Brooke, J. (1996). SUS: A 'quick and dirty' usability scale. In P. W. Jordan, B. Thomas, B. A. Weerdmeester, & I. L. McClelland (Eds.), Usability evaluation in industry (pp. 189-194). Taylor & Francis. https://doi.org/10.1201/9781498710411-35
- Lewis, J. R., & Sauro, J. (2018). Item benchmarks for the System Usability Scale. Journal of Usability Studies, 13(3), 158-167. https://uxpajournal.org/item-benchmarks-system-usability-scale-sus/
- International Organization for Standardization. (2018). ISO 9241-11:2018 Ergonomics of human-system interaction — Part 11: Usability: Definitions and concepts. ISO. https://www.iso.org/standard/63500.html
- International Organization for Standardization. (2019). ISO 9241-210:2019 Ergonomics of human-system interaction — Part 210: Human-centred design for interactive systems. ISO. https://www.iso.org/standard/77520.html
- Marcus, A., & Gould, E. W. (2000). Crosscurrents: Cultural dimensions and global web user-interface design. Interactions, 7(4), 32-46. https://doi.org/10.1145/345190.345238
- Alsswey, A., & Al-Samarraie, H. (2021). The role of Hofstede's cultural dimensions in the design of user interface: The case of Arabic. Artificial Intelligence for Engineering Design, Analysis and Manufacturing, 35(1), 116-127. https://doi.org/10.1017/S0890060421000019
- Hofstede, G. (2011). Dimensionalizing cultures: The Hofstede model in context. Online Readings in Psychology and Culture, 2(1), Article 8. https://doi.org/10.9707/2307-0919.1014
- Cao, J., Zheng, L., Jiao, R. J., Li, K. W., & Xie, D. (2025). VR interface features for mental workload in manufacturing contexts: An integrated EEG-based approach. Journal of Intelligent Manufacturing. https://doi.org/10.1007/s10845-025-02452-0
- Villani, V., Pini, F., Leali, F., Secchi, C., & Fantuzzi, C. (2022). Towards adaptive interaction systems for inclusive workplaces: An Industry 5.0 perspective. IEEE Transactions on Automation Science and Engineering, 19(4), 3300-3310. https://doi.org/10.1109/TASE.2021.3117474
- Zhu, Y., Romain, C., Maguire, M., & Wang, J. (2024). Human-AI collaboration in smart manufacturing: A systematic review. Advanced Engineering Informatics, 62, Article 102738. https://doi.org/10.1016/j.aei.2024.102738
- Liu, X., Li, W., & Chen, Y. (2024). Multiple pathways to digital transformation in manufacturing enterprises: An analysis based on TOE framework and fsQCA method. PLOS ONE, 19(10), e0315249. https://doi.org/10.1371/journal.pone.0315249
- Li, Y., & Zhao, M. (2024). Intelligent manufacturing innovation and enterprise performance: Configuration analysis based on TOE framework. PLOS ONE, 19(9), e0309784. https://doi.org/10.1371/journal.pone.0309784
- Fan, D., Zhou, Y., Li, Y., & Wang, Z. (2023). How do manufacturing enterprises construct sustainable competitive advantage? A multi-configuration analysis based on fsQCA and NCA. Sustainability, 15(1), Article 542. https://doi.org/10.3390/su15010542
References
Rombach, R., Blattmann, A., Lorenz, D., Esser, P., & Ommer, B. (2022). High-resolution image synthesis with latent diffusion models. Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 10684-10695. https://doi.org/10.1109/CVPR52688.2022.01042
Park, S.-G., Quan, W., Jin, H., Kim, S., & Park, J. (2023). UI layout generation with LLMs guided by UI grammar. arXiv preprint, arXiv:2310.15455. https://arxiv.org/abs/2310.15455
Huang, F., Cai, G., Chen, X. L., He, G., Cheng, Z., Yang, J., & Yao, W. (2024). PosterLlama: Bridging design ability of language models to contents-aware layout generation. arXiv preprint, arXiv:2406.02884. https://arxiv.org/abs/2406.02884
Mourtzis, D., Angelopoulos, J., & Panopoulos, N. (2023). The future of the human–machine interface (HMI) in society 5.0. Future Internet, 15(5), 162.https://doi.org/10.3390/fi15050162
Carrera-Rivera, A., Larrinaga, F., Lasa, G., Martinez-Arellano, G., & Unamuno, G. (2022). How-to conduct a systematic literature review: A quick guide for computer science research. MethodsX, 9, Article 101895. https://doi.org/10.1016/j.mex.2022.101895
Cheng, Z., Liang, J., Bai, X., Yang, X., & Xu, H. (2024). Play to your strengths: Collaborative intelligence of conventional machine learning and large language models. arXiv preprint, arXiv:2411.09753. https://arxiv.org/abs/2411.09753
Song, X., & Shao, Y. (2024). A comparative analysis of user interface design in foreign language intelligent tutoring systems: Effectiveness, user experience, and pedagogical implications. PeerJ Computer Science, 10, e2336. https://doi.org/10.7717/peerj-cs.2336
Bai, Y., Kadavath, S., Kundu, S., Askell, A., Kernion, J., Jones, A., ... & Kaplan, J. (2022). Constitutional AI: Harmlessness from AI feedback. arXiv preprint, arXiv:2212.08073. https://arxiv.org/abs/2212.08073
Niu, H., Li, Y., Tang, F., Zhou, H., Chen, H., Chen, Y., ... & Wang, J. (2023). Parameter-efficient long-tailed recognition. arXiv preprint, arXiv:2309.10019. https://arxiv.org/abs/2309.10019
Alves, T., Lima, M. A., & Gaspar, L. (2024). GenAI Chatbot User Interfaces. In Extended Abstracts of the 2024 CHI Conference on Human Factors in Computing Systems (CHI EA '24), Article 393, 1-5. https://doi.org/10.1145/3613905.3650940
Ragin, C. C. (2008). Redesigning social inquiry: Fuzzy sets and beyond. University of Chicago Press. https://doi.org/10.7208/chicago/9780226702797.001.0001
Ho, J., Jain, A., & Abbeel, P. (2020). Denoising diffusion probabilistic models. Advances in Neural Information Processing Systems, 33, 6840-6851. https://arxiv.org/abs/2006.11239
Nielsen, J. (1994). Usability engineering. Morgan Kaufmann Publishers. https://doi.org/10.1016/C2009-0-21512-1
Norman, D. A. (2013). The design of everyday things: Revised and expanded edition. Basic Books. https://www.hachettebookgroup.com/titles/don-norman/the-design-of-everyday-things/9780465050659/
Hassenzahl, M., & Tractinsky, N. (2006). User experience – A research agenda. Behaviour & Information Technology, 25(2), 91-97. https://doi.org/10.1080/01449290500330331
Goodfellow, I., Pouget-Abadie, J., Mirza, M., Xu, B., Warde-Farley, D., Ozair, S., Courville, A., & Bengio, Y. (2014). Generative adversarial nets. Advances in Neural Information Processing Systems, 27, 2672-2680. https://arxiv.org/abs/1406.2661
Vaswani, A., Shazeer, N., Parmar, N., Uszkoreit, J., Jones, L., Gomez, A. N., Kaiser, Ł., & Polosukhin, I. (2017). Attention is all you need. Advances in Neural Information Processing Systems, 30, 5998-6008. https://arxiv.org/abs/1706.03762
Brown, T., Mann, B., Ryder, N., Subbiah, M., Kaplan, J. D., Dhariwal, P., ... & Amodei, D. (2020). Language models are few-shot learners. Advances in Neural Information Processing Systems, 33, 1877-1901. https://arxiv.org/abs/2005.14165
Zhang, L., Rao, A., & Agrawala, M. (2023). Adding conditional control to text-to-image diffusion models. Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV), 3813-3824. https://doi.org/10.1109/ICCV51070.2023.00355
Cao, M., Wang, X., Qi, Z., Shan, Y., Qie, X., & Zheng, Y. (2024). Controllable generation with text-to-image diffusion models: A survey. arXiv preprint, arXiv:2403.04279. https://arxiv.org/abs/2403.04279
Zhang, Z., Zhang, L., Jia, J., Zhong, Y., Deng, Y., Zhou, J., ... & Yu, J. (2023). A survey on text-to-image diffusion models. arXiv preprint, arXiv:2303.07909. https://arxiv.org/abs/2303.07909
Zhao, S., Chen, D., Chen, Y. C., Bao, J., Hao, S., Yuan, L., & Wong, K. Y. K. (2023). Uni-ControlNet: All-in-one control to text-to-image diffusion models. Advances in Neural Information Processing Systems, 36, 55450-55470. https://arxiv.org/abs/2305.16322
Brooke, J. (1996). SUS: A 'quick and dirty' usability scale. In P. W. Jordan, B. Thomas, B. A. Weerdmeester, & I. L. McClelland (Eds.), Usability evaluation in industry (pp. 189-194). Taylor & Francis. https://doi.org/10.1201/9781498710411-35
Lewis, J. R., & Sauro, J. (2018). Item benchmarks for the System Usability Scale. Journal of Usability Studies, 13(3), 158-167. https://uxpajournal.org/item-benchmarks-system-usability-scale-sus/
International Organization for Standardization. (2018). ISO 9241-11:2018 Ergonomics of human-system interaction — Part 11: Usability: Definitions and concepts. ISO. https://www.iso.org/standard/63500.html
International Organization for Standardization. (2019). ISO 9241-210:2019 Ergonomics of human-system interaction — Part 210: Human-centred design for interactive systems. ISO. https://www.iso.org/standard/77520.html
Marcus, A., & Gould, E. W. (2000). Crosscurrents: Cultural dimensions and global web user-interface design. Interactions, 7(4), 32-46. https://doi.org/10.1145/345190.345238
Alsswey, A., & Al-Samarraie, H. (2021). The role of Hofstede's cultural dimensions in the design of user interface: The case of Arabic. Artificial Intelligence for Engineering Design, Analysis and Manufacturing, 35(1), 116-127. https://doi.org/10.1017/S0890060421000019
Hofstede, G. (2011). Dimensionalizing cultures: The Hofstede model in context. Online Readings in Psychology and Culture, 2(1), Article 8. https://doi.org/10.9707/2307-0919.1014
Cao, J., Zheng, L., Jiao, R. J., Li, K. W., & Xie, D. (2025). VR interface features for mental workload in manufacturing contexts: An integrated EEG-based approach. Journal of Intelligent Manufacturing. https://doi.org/10.1007/s10845-025-02452-0
Villani, V., Pini, F., Leali, F., Secchi, C., & Fantuzzi, C. (2022). Towards adaptive interaction systems for inclusive workplaces: An Industry 5.0 perspective. IEEE Transactions on Automation Science and Engineering, 19(4), 3300-3310. https://doi.org/10.1109/TASE.2021.3117474
Zhu, Y., Romain, C., Maguire, M., & Wang, J. (2024). Human-AI collaboration in smart manufacturing: A systematic review. Advanced Engineering Informatics, 62, Article 102738. https://doi.org/10.1016/j.aei.2024.102738
Liu, X., Li, W., & Chen, Y. (2024). Multiple pathways to digital transformation in manufacturing enterprises: An analysis based on TOE framework and fsQCA method. PLOS ONE, 19(10), e0315249. https://doi.org/10.1371/journal.pone.0315249
Li, Y., & Zhao, M. (2024). Intelligent manufacturing innovation and enterprise performance: Configuration analysis based on TOE framework. PLOS ONE, 19(9), e0309784. https://doi.org/10.1371/journal.pone.0309784
Fan, D., Zhou, Y., Li, Y., & Wang, Z. (2023). How do manufacturing enterprises construct sustainable competitive advantage? A multi-configuration analysis based on fsQCA and NCA. Sustainability, 15(1), Article 542. https://doi.org/10.3390/su15010542