Search in:

Applied Artificial Intelligence

An International Journal

Volume 38, 2024 - Issue 1

Submit an article Journal homepage

Open access

283

Views

CrossRef citations to date

Altmetric

Research Article

VBNet: A Visually-Aware Biomimetic Network for Simulating the Human Eye’s Visual System

Zhaofei Lia College of Automation and Information Engineering, Sichuan University of Science & Engineering, Yibin, China;b Key Laboratory of Higher Education of Sichuan Province for Enterprise Informationalization and Internet of Things, Yibin, China;c Artificial Intelligence Key Laboratory of Sichuan Province, Sichuan University of Science & Engineering, Yibin, ChinaView further author information

Yufan Maoa College of Automation and Information Engineering, Sichuan University of Science & Engineering, Yibin, ChinaView further author information

Mingshan Zhonga College of Automation and Information Engineering, Sichuan University of Science & Engineering, Yibin, China;b Key Laboratory of Higher Education of Sichuan Province for Enterprise Informationalization and Internet of Things, Yibin, ChinaCorrespondence[email protected]

https://orcid.org/0000-0002-6258-3526 View further author information

Jun Zhaoa College of Automation and Information Engineering, Sichuan University of Science & Engineering, Yibin, China;b Key Laboratory of Higher Education of Sichuan Province for Enterprise Informationalization and Internet of Things, Yibin, ChinaView further author information

Article: 2335100 | Received 22 Oct 2023, Accepted 21 Mar 2024, Published online: 01 Apr 2024

Cite this article
https://doi.org/10.1080/08839514.2024.2335100
CrossMark

Full Article
Figures & data
References
Citations
Metrics
Licensing
Reprints & Permissions
View PDF PDF View EPUB EPUB

References

Chollet, F. 2017. Xception: Deep learning with depthwise separable convolutions. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Honolulu, HI, USA, 1251–18.
Google Scholar
Cubuk, E. D., B. Zoph, J. Shlens, and Q. V. Le. 2020. Randaugment: Practical automated data augmentation with a reduced search space. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) Workshops, Seattle, WA, USA, 702–03.
Google Scholar
Dosovitskiy, A., L. Beyer, A. Kolesnikov, D. Weissenborn, X. Zhai, T. Unterthiner, M. Dehghani, M. Minderer, G. Heigold, S. Gelly, et al. 2021. An image is worth 16x16 words: Transformers for image recognition at scale (arXiv:2010.11929). arXiv doi:10.48550/arXiv.2010.11929.
Google Scholar
He, K., X. Zhang, S. Ren, and J. Sun. 2016. Deep residual learning for image recognition. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Las Vegas, NV, USA, 770–78.
Google Scholar
Hu, J., L. Shen, and G. Sun. 2018. Squeeze-and-excitation networks. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Salt Lake City, UT, USA, 7132–41.
Google Scholar
Lin, T.-Y., M. Maire, S. Belongie, J. Hays, P. Perona, D. Ramanan, P. Dollár, and C. L. Zitnick. 2014. Microsoft COCO: Common objects in context. In Computer vision – ECCV 2014, ed. D. Fleet, T. Pajdla, B. Schiele, and T. Tuytelaars, 740–55. Zurich, Switzerland: Springer International Publishing.
Google Scholar
Liu, Z., Y. Lin, Y. Cao, H. Hu, Y. Wei, Z. Zhang, S. Lin, and B. Guo 2021. Swin transformer: hierarchical vision transformer using shifted windows. In Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV), Montreal, QC, Canada, 10012–22.
Google Scholar
Luo, W., Y. Li, R. Urtasun, and R. Zemel. 2016. Understanding the effective receptive field in deep convolutional neural networks. In Advances in neural information processing systems, ed. D. Lee, M. Sugiyama, U. Luxburg, I. Guyon, and R. Garnett, vol. 29. Curran Associates, Inc.
Google Scholar
Radosavovic, I., R. P. Kosaraju, R. Girshick, K. He, and P. Dollar. 2020. Designing network design spaces. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), Seattle, WA, USA, 10428–36.
Google Scholar
Sandler, M., A. Howard, M. Zhu, A. Zhmoginov, and L.-C. Chen. 2018. MobileNetV2: Inverted residuals and linear bottlenecks. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Salt Lake City, UT, USA, 4510–20.
Google Scholar
Shen, L., and Y. Wang. 2022. TCCT: Tightly-coupled convolutional transformer on time series forecasting. Neurocomputing 480:131–45. doi:10.1016/j.neucom.2022.01.039.
Web of Science ®Google Scholar
Touvron, H., M. Cord, M. Douze, F. Massa, A. Sablayrolles, and H. Jegou. 2021. Training data-efficient image transformers & distillation through attention. In Proceedings of the 38th International Conference on Machine Learning, 10347–57.
Google Scholar
Wang, W., E. Xie, X. Li, D.-P. Fan, K. Song, D. Liang, T. Lu, P. Luo, and L. Shao. 2021. Pyramid vision transformer: A versatile backbone for dense prediction without convolutions.” In Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV), Montreal, QC, Canada, 568–78.
Google Scholar
Wang, Q., S. Zhang, Y. Qian, G. Zhang, and H. Wang. 2022. Enhancing representation learning by exploiting effective receptive fields for object detection. Neurocomputing 481:22–32. doi:10.1016/j.neucom.2022.01.020.
Web of Science ®Google Scholar
Woo, S., J. Park, J.-Y. Lee, and I. S. Kweon. 2018. CBAM: Convolutional block attention module. In Proceedings of the European Conference on Computer Vision (ECCV), Munich, Germany, 3–19
Google Scholar
Wu, H., B. Xiao, N. Codella, M. Liu, X. Dai, L. Yuan, and L. Zhang. 2021. CvT: Introducing convolutions to vision transformers. In Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV), Montreal, QC, Canada, 22–31.
Google Scholar
Yuan, L., Y. Chen, T. Wang, W. Yu, Y. Shi, Z.-H. Jiang, F. E. H. Tay, J. Feng, and S. Yan. 2021. Tokens-to-Token ViT: Training vision transformers from scratch on ImageNet. In Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV), Montreal, Canada, 558–67.
Google Scholar
Yu, W., M. Luo, P. Zhou, C. Si, Y. Zhou, X. Wang, J. Feng, and S. Yan. 2022. MetaFormer is actually what you need for vision. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), New Orleans, USA, 10819–29.
Google Scholar
Yun, S., D. Han, S. J. Oh, S. Chun, J. Choe, and Y. Yoo. 2019. CutMix: Regularization strategy to train strong classifiers with localizable features. In Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV), Seoul, South Korea, 6023–32.
Google Scholar
Zeng, K., Q. Ma, J. Wu, S. Xiang, T. Shen, and L. Zhang. 2022. Nlfftnet: A non-local feature fusion transformer network for multi-scale object detection. Neurocomputing 493:15–27. doi:10.1016/j.neucom.2022.04.062.
Web of Science ®Google Scholar
Zhang, H., M. Cisse, Y. N. Dauphin, and D. Lopez-Paz. 2018. mixup: Beyond Empirical Risk Minimization. arXiv:1710.09412. arXiv.
Google Scholar
Zhong, Z., L. Zheng, G. Kang, S. Li, and Y. Yang. 2020. Random erasing data augmentation. In Proceedings of the AAAI Conference on Artificial Intelligence, New York City, USA, 34 (07), Article 07.
Google Scholar
Zhou, B., A. Khosla, A. Lapedriza, A. Oliva, and A. Torralba. 2016. Learning deep features for discriminative localization. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Las Vegas, USA, 2921–29.
Google Scholar

Share icon
Back to Top

Related research

People also read lists articles that other readers of this article have read.

Recommended articles lists articles that we recommend and is powered by our AI driven recommendation engine.

Cited by lists all citing articles based on Crossref citations.
Articles with the Crossref icon will open in a new tab.

People also read
Recommended articles
Cited by

To cite this article:

Reference style: APA Chicago Harvard

Citation copied to clipboard

Reference styles above use APA (6th edition), Chicago (16th edition) & Harvard (10th edition)

Download citation

Download a citation file in RIS format that can be imported by citation management software including EndNote, ProCite, RefWorks and Reference Manager.

Choose format: RIS BibTex RefWorks Direct Export

Choose options: Citation Citation & abstract Citation & references

VBNet: A Visually-Aware Biomimetic Network for Simulating the Human Eye’s Visual System

References

Information for

Open access

Opportunities

Help and information

Your download is now in progress and you may close this window

Login or register to access this feature

VBNet: A Visually-Aware Biomimetic Network for Simulating the Human Eye’s Visual System

References

Related research

To cite this article:

Download citation

Information for

Open access

Opportunities

Help and information

Keep up to date