Hello! It's great to hear that you are interested in developing a model for image to text extraction and NLP tasks to automate librarian tasks such as cataloging, abstracting, and indexing from extracted text from books. This is indeed a fascinating project that can have significant applications in the field of library science.
If you are looking for someone to help you with this project, you may consider reaching out to professionals or researchers in the field of machine learning, computer vision, and natural language processing. You can also explore academic institutions or research labs that specialize in these areas.
If you are considering developing this project for your thesis, it's important to outline your objectives, methodology, and expected outcomes clearly. You may also need to conduct a literature review to understand the existing research in this area and identify the gaps you aim to address with your project.
In terms of implementation, you can consider using libraries and frameworks such as TensorFlow, PyTorch, OpenCV for image processing, and libraries like NLTK or spaCy for natural language processing tasks.
If you have any specific questions or need guidance on how to get started with your project, feel free to ask. Good luck with your thesis and project!