Small sample data pricing research based on Reptile algorithm
DOI: https://doi.org/10.3846/jbem.2025.23768Abstract
Reasonably pricing data resources contributes to the development of the digital economy. However, in the early stage of the development of the data market, the data transaction volume is small, and the resulting small sample problem makes it difficult to accurately model and forecast the price of data. To address this issue, this paper constructs a meta-learning-based data price prediction model: MLP-Reptile. The model introduces a meta-learning tuning module to optimize the weight parameters of the base model, facilitating effective knowledge transfer learned from multiple tasks to enhance prediction accuracy in new tasks. Experimental results demonstrate that the proposed MLP-Reptile model excels in small sample data pricing tasks, outperforming other models. Additionally, the paper analyzes the different primary factors influencing data prices in various industries. The methods proposed in this research are universally applicable for addressing the small sample problem in data pricing, providing a reference for solving similar issues.
Keywords:
data pricing, small sample, bayesian optimization, meta-learning, reptile algorithm, digital economyHow to Cite
Share
License
Copyright (c) 2025 The Author(s). Published by Vilnius Gediminas Technical University.
This work is licensed under a Creative Commons Attribution 4.0 International License.
References
Ali, S., Akhlaq, F., Imran, A. S., Kastrati, Z., Daudpota, S. M., & Moosa, M. (2023). The enlightening role of explainable artificial intelligence in medical & healthcare domains: A systematic literature review. Computers in Biology and Medicine, 166, Article 107555. https://doi.org/10.1016/j.compbiomed.2023.107555
Bjorck, N., Gomes, C. P., Selman, B., & Weinberger, K. Q. (2018). Understanding batch normalization. Advances in Neural Information Processing Systems, 31, 1–12.
Chen, F., Yan, J., Liu, Y., Yan, Y., & Tjernberg, L. B. (2024). A novel meta-learning approach for few-shot short-term wind power forecasting._ Applied Energy, 362, Article 122838. https://doi.org/10.1016/j.apenergy.2024.122838
Demetzou, K., Zanfir-Fortuna, G., & Barros Vale, S. (2023). The thin red line: Refocusing data protection law on ADM, a global perspective with lessons from case-law. Computer Law & Security Review, 49, Article 105806. https://doi.org/10.1016/j.clsr.2023.105806
Du, X., Li, X., Zhang, S., Zhao, T., Hou, Q., Jin, X., & Zhang, J. (2022). High-accuracy estimation method of typhoon storm surge disaster loss under small sample conditions by information diffusion model coupled with machine learning models. International Journal of Disaster Risk Reduction, 82, Article 103307. https://doi.org/10.1016/j.ijdrr.2022.103307
Finn, C., Abbeel, P., & Levine, S. (2017). Model-agnostic meta-learning for fast adaptation of deep networks. Proceedings of the 34th International Conference on Machine Learning, 70, 1126–1135.
Ge, J., Tang, H., Yang, N., & Hu, Y. (2023). Rapid identification of damaged buildings using incremental learning with transferred data from historical natural disaster cases. ISPRS Journal of Photogrammetry and Remote Sensing, 195, 105–128. https://doi.org/10.1016/j.isprsjprs.2022.11.010
Guoxin Youyi (n.d.). https://infobase.deeprocknetzero.com/detail/449?
Guo, Z., Hao, J., & Kennedy, L. (2024). Protection path of personal data and privacy in China: Moving from monism to dualism in civil law and then in criminal law. Computer Law & Security Review, 52, Article 105928. https://doi.org/10.1016/j.clsr.2023.105928
Hao, J., Deng, Z., & Li, J. (2023a). The evolution of data pricing: From economics to computational intelligence. Heliyon, 9(9), Article e20274. https://doi.org/10.1016/j.heliyon.2023.e20274
Hao, J., Yuan, J., Li, J., Liu, M., & Liu, Y. (2023b). Ensemble pricing model for data assets with ranking-pruning-averaging strategy. Procedia Computer Science, 221, 813–820. https://doi.org/10.1016/j.procs.2023.08.055
Hoerl, A. E., & Kennard, R. W. (1970). Ridge regression: Biased estimation for nonorthogonal problems. Technometrics, 12(1), 55–67. https://doi.org/10.1080/00401706.1970.10488634
Ioffe, S., & Szegedy, C. (2015). Batch normalization: Accelerating deep network training by reducing internal covariate shift. Proceedings of the 32nd International Conference on International Conference on Machine Learning, 37, 448–456.
Jia, W., Collins, A., & Liu, W. (2023). Digitalization and economic growth in the new classical and new structural economics perspectives. Digital Economy and Sustainable Development, 1, Article 5. https://doi.org/10.1007/s44265-023-00007-0
JD Cloud. (n.d.). JD Wanxiang Data Platform. https://wx.jdcloud.com/
Liu, J. (2021). Development practice of data factor market in China: Achievements, problems and suggestions. Price: Theory & Practice, (12), 18–22.
Lee, J., & Yang, C. (2022). Deep neural network and meta-learning-based reactive sputtering with. Journal of Manufacturing Systems, 62, 703–717. https://doi.org/10.1016/j.jmsy.2022.02.004
Li, L., Fan, Y., Tse, M., & Lin, K.-Y. (2020). A review of applications in federated learning. Computers & Industrial Engineering, 149, Article 106854. https://doi.org/10.1016/j.cie.2020.106854
Li, X., Chen, K., Yang, J., Wang, C., Yang, T., Luo, C., Li, N., & Liu, Z. (2023). TLDA: A transfer learning based dual-augmentation strategy for traditional Chinese Medicine syndrome differentiation in rare disease. Computers in Biology and Medicine, 169, Article 107808. https://doi.org/10.1016/j.compbiomed.2023.107808
Li, X., Su, H., Xiang, L., Yao, Q., & Hu, A. (2024). Transformer-based meta learning method for bearing fault identification under multiple small sample conditions. Mechanical Systems and Signal Processing, 208, Article 110967. https://doi.org/10.1016/j.ymssp.2023.110967
Liao, J., & Li, R. (2023). Establishing a two-way transaction pricing model of “platform-individual” co-creation data property rights. Journal of Innovation & Knowledge, 8(4), Article 100427. https://doi.org/10.1016/j.jik.2023.100427
Lundberg, S. M., & Lee, S.-I. (2017). A unified approach to interpreting model predictions. Advances in Neural Information Processing Systems, 30, 1–10.
Lv, Z., Qiao, L., Hossain, M., & Choi, B. (2021). Analysis of using blockchain to protect the privacy of drone big data. IEEE Network, 35(1), 44–49. https://doi.org/10.1109/MNET.011.2000154
Miao, X., Gao, Y., Chen, L., Peng, H., Yin, J., & Li, Q. (2021, April 19–22). Towards query pricing on incomplete data. In Proceedings of the 2021 IEEE 37th International Conference on Data Engineering (ICDE) (pp. 2348–2349). Chania, Greece. IEEE. https://doi.org/10.1109/ICDE51399.2021.00260
Minot, M., & Reddy, S. T. (2024). Meta learning addresses noisy and under-labeled data in machine learning-guided antibody engineering. Cell Systems, 15(1), 4–18. https://doi.org/10.1016/j.cels.2023.12.003
National Standards Committee. (2018, June 07). Information technology Data Transaction Service transaction data Description (Standard No. GB/T 36343-2018). https://openstd.samr.gov.cn/bzgk/gb/newGbInfo?hcno=445371F13A0DED107EF13579D018ACEA
Ng, A. Y. (2004). Feature selection, L1 vs. L2 regularization, and rotational invariance. In Proceedings of the Twenty-first International Conference on Machine Learning (p. 78). Association for Computing Machinery. https://doi.org/10.1145/1015330.1015435
Nichol, A., Achiam, J., & Schulman, J. (2018). On first-order meta-learning algorithms. Arxiv. https://doi.org/10.48550/arXiv.1803.02999
Niyato, D., Alsheikh, M. A., Wang, P., Kim, D. I., & Han, Z. (2016 May 22–27). Market model and optimal pricing scheme of big data and Internet of Things (IoT). In 2016 IEEE International Conference on Communications (ICC) (pp. 1–6). Kuala Lumpur, Malaysia. IEEE. https://doi.org/10.1109/ICC.2016.7510922
Paeedeh, N., Pratama, M., Ma’sum, M. A., Mayer, W., Cao, Z., & Kowlczyk, R. (2024). Cross-domain few-shot learning via adaptive transformer networks. Knowledge-Based Systems, 288, Article 111458. https://doi.org/10.1016/j.knosys.2024.111458
Ribeiro, M. T., Singh, S., & Guestrin, C. (2016). “Why should I trust you?”: Explaining the predictions of any classifier. In Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (pp. 1135–1144). Association for Computing Machinery. https://doi.org/10.1145/2939672.2939778
Rong, K. (2022). Research agenda for the digital economy. Journal of Digital Economy, 1(1), 20–31. https://doi.org/10.1016/j.jdec.2022.08.004
Santurkar, S., Tsipras, D., Ilyas, A., & Madry, A. (2018). How does batch normalization help optimization? In Proceedings of the 32nd International Conference of Neural Information Processing Systems (pp. 2488–2498). Curran Associates Inc.
Sun, J., Wei, D., Wang, L., & Zheng, Y. (2024). Hybrid unsupervised representation learning and pseudo-label supervised self-distillation for rare disease imaging phenotype classification with dispersion-aware imbalance correction. Medical Image Analysis, 93, Article 103102. https://doi.org/10.1016/j.media.2024.103102
Tian, C., Zhu, X., Hu, Z., & Ma, J. (2021). A transfer approach with attention reptile method and long-term generation mechanism for few-shot traffic prediction. Neurocomputing, 452, 15–27. https://doi.org/10.1016/j.neucom.2021.03.068
Tian, G., & Wu, W. (2023). Big data pricing in marketplace lending and price discrimination against repeat borrowers: Evidence from China. China Economic Review, 78, Article 101944. https://doi.org/10.1016/j.chieco.2023.101944
Tianyuan Data. (n.d.). https://www.tdata.cn/
Vo, S.-N., Vo, T.-T., & Le, B. (2024). Interpretable extractive text summarization with meta-learning and BI-LSTM: A study of meta learning and explainability techniques. Expert Systems with Applications, 245, Article 123045. https://doi.org/10.1016/j.eswa.2023.123045
Weng, Y., & Paal, S. G. (2024). Extrapolating wind pressures on roof soffits of low-rise buildings using few-shot learning. Journal of Building Engineering, 82, Article 108110. https://doi.org/10.1016/j.jobe.2023.108110
Xu, J., Hong, N., Xu, Z., Zhao, Z., Wu, C., Kuang, K., Wang, J., Zhu, M., Zhou, J., Ren, K., Yang, X., Lu, C., Pei, J., & Shum, H. (2023). Data-driven learning for data rights, data pricing, and privacy computing. Engineering, 25, 66–76. https://doi.org/10.1016/j.eng.2022.12.008
Xu, L., Jiang, C., Qian, Y., Zhao, Y., Li, J. & Ren, Y., (2016). Dynamic privacy pricing: A multi-armed bandit approach with time-variant rewards. IEEE Transactions on Information Forensics and Security, 12(2), 271–285. https://doi.org/10.1109/TIFS.2016.2611487
Yanik, E., Intes, X., Kruger, U., Yan, P., Diller, D. Van Voorst, B., Makled, B., Norfleet, J., & De, S. (2022). Deep neural networks for the assessment of surgical skills: A systematic review. The Journal of Defense Modeling and Simulation, 19(2), 159–171. https://doi.org/10.1177/15485129211034586
Yanik, E., Schwaitzberg, S., Yang, G., Intes, X., Norfleet, J., Hackett, M., & De, S. (2024). One-shot skill assessment in high-stakes domains with limited data via meta learning. Computers in Biology and Medicine, 174, Article 108470. https://doi.org/10.1016/j.compbiomed.2024.108470
Ye, Y., Zhou, A., Shi, X., & Huang, C. (2022). A SEED model for constructing the data factor market: Evidence from Guiyang Global Big Data Exchange (GBDEx) in China. Journal of Digital Economy, 1(3), 273–283. https://doi.org/10.1016/j.jdec.2023.03.002
You, J., Lou, S., Mao, R., & Xu, T. (2022). An improved FMEA quality risk assessment framework for enterprise data assets. Journal of Digital Economy, 1(3), 141–152. https://doi.org/10.1016/j.jdec.2022.12.001
Yu, B., & Zhao, H. (2019). Research on the construction of big data trading platform in China. In Proceedings of the 2019 4th International Conference on Intelligent Information Technology (pp. 107–112). Association for Computing Machinery. https://doi.org/10.1145/3321454.3321474
Zhang, X., Hu, C., He, B., & Han, Z. (2022). Distributed reptile algorithm for meta-learning over multi-agent systems. IEEE Transactions on Signal Processing, 70, 5443–5456. https://doi.org/10.1109/TSP.2022.3222734
Zhao, H., Qiu, S., Bai, M., Wang, L., & Wang, Z. (2024). Toxicity prediction and classification of Gunqile-7 with small sample based on transfer learning method. Computers in Biology and Medicine, 173, Article 108348. https://doi.org/10.1016/j.compbiomed.2024.108348
View article in other formats
Published
Issue
Section
Copyright
Copyright (c) 2025 The Author(s). Published by Vilnius Gediminas Technical University.
License
This work is licensed under a Creative Commons Attribution 4.0 International License.