“We collaborated for a short period during my stint at Belong. co and it was a great honor to know Harshit. He has a very positive attitude towards work. He is a really extraordinary person. He always goes out of his way when you ask for some help and guidance.”
About
Deep Learning, Computer Vision, Natural Language Processing, Text Mining, Information…
Activity
-
In light of recent tech industry changes affecting our AI/ML colleagues at Meta, I extend my support to those impacted. If you or your friends have…
In light of recent tech industry changes affecting our AI/ML colleagues at Meta, I extend my support to those impacted. If you or your friends have…
Liked by Harshit Pande
-
Thank you, Graphic Era Deemed to be University Alumni Association, for this post. It means a lot coming from my alma mater. 🫶🙏🏻❣️
Thank you, Graphic Era Deemed to be University Alumni Association, for this post. It means a lot coming from my alma mater. 🫶🙏🏻❣️
Liked by Harshit Pande
-
Simpson’s paradox: a treatment that looks superior may be inferior for EVERY slice. In this example, which I share in my advanced course…
Simpson’s paradox: a treatment that looks superior may be inferior for EVERY slice. In this example, which I share in my advanced course…
Liked by Harshit Pande
Experience
Education
Licenses & Certifications
Volunteer Experience
-
Educator
click2study
- Present 12 years 8 months
Education
Made freely available educational videos in a self-owned YouTube channel which as of December 2017 has roughly 2500 subscribers. The link to the channel: https://www.youtube.com/user/click2study/featured
Publications
-
Field-Embedded Factorization Machines for Click-through rate prediction
arXiv.org
See publicationClick-through rate (CTR) prediction models are common in many online applications such as digital advertising and recommender systems. Field-Aware Factorization Machine (FFM) and Field-weighted Factorization Machine (FwFM) are state-of-the-art among the shallow models for CTR prediction. Recently, many deep learning-based models have also been proposed. Among deeper models, DeepFM, xDeepFM, AutoInt+, and FiBiNet are state-of-the-art models. The deeper models combine a core architectural…
Click-through rate (CTR) prediction models are common in many online applications such as digital advertising and recommender systems. Field-Aware Factorization Machine (FFM) and Field-weighted Factorization Machine (FwFM) are state-of-the-art among the shallow models for CTR prediction. Recently, many deep learning-based models have also been proposed. Among deeper models, DeepFM, xDeepFM, AutoInt+, and FiBiNet are state-of-the-art models. The deeper models combine a core architectural component, which learns explicit feature interactions, with a deep neural network (DNN) component. We propose a novel shallow Field-Embedded Factorization Machine (FEFM) and its deep counterpart Deep Field-Embedded Factorization Machine (DeepFEFM). FEFM learns symmetric matrix embeddings for each field pair along with the usual single vector embeddings for each feature. FEFM has significantly lower model complexity than FFM and roughly the same complexity as FwFM. FEFM also has insightful mathematical properties about important fields and field interactions. DeepFEFM combines the FEFM interaction vectors learned by the FEFM component with a DNN and is thus able to learn higher order interactions. We conducted comprehensive experiments over a wide range of hyperparameters on two large publicly available real-world datasets. When comparing test AUC and log loss, the results show that FEFM and DeepFEFM outperform the existing state-of-the-art shallow and deep models for CTR prediction tasks.
-
Deep Learning for Weak Supervision of Diabetic Retinopathy Abnormalities
2019 IEEE 16th International Symposium on Biomedical Imaging (ISBI 2019)
See publicationDeep learning-based grading of the fundus images of the retina is an active area of research. Various existing studies use different deep learning architectures on different datasets. Results of some of the studies could not be replicated in other studies. Thus a benchmarking study across multiple architectures spanning both classification and localization is needed. We present a comparative study of different state-of-the-art architectures trained on a proprietary dataset and tested on the…
Deep learning-based grading of the fundus images of the retina is an active area of research. Various existing studies use different deep learning architectures on different datasets. Results of some of the studies could not be replicated in other studies. Thus a benchmarking study across multiple architectures spanning both classification and localization is needed. We present a comparative study of different state-of-the-art architectures trained on a proprietary dataset and tested on the publicly available Messidor-2 dataset. Although evidence is of utmost importance in AI-based medical diagnosis, most studies limit themselves to the classification performance and do not report the quantification of the performance of the abnormalities localization. To alleviate this, using class activation maps, we also report a comparison of localization scores for different architectures. For classification, we found that as the number of parameters increase, the models perform better, with NASNet yielding highest accuracy and average precision, recall, and F1-scores of around 95%. For localization, VGG19 outperformed all the models with a mean Intersection over Minimum of 0.45. We also found that there is a trade-off between classification performance and localization performance. As the models get deeper, their receptive field increases, causing them to perform well on classification but underperform on the localization of fine-grained abnormalities.
-
Dynamic Region Proposal Networks For Semantic Segmentation In Automated Glaucoma Screening
2019 IEEE 16th International Symposium on Biomedical Imaging (ISBI 2019)
See publicationScreening for the diagnosis of glaucoma through a fundus image can be determined by the optic cup to disc diameter ratio (CDR), which requires the segmentation of the cup and disc regions. In this paper, we propose two novel approaches, namely Parameter-Shared Branched Network (PSBN) and Weak Region of Interest Model-based segmentation (WRoIM) to identify disc and cup boundaries. Unlike the previous approaches, the proposed methods are trained endto-end through a single neural network…
Screening for the diagnosis of glaucoma through a fundus image can be determined by the optic cup to disc diameter ratio (CDR), which requires the segmentation of the cup and disc regions. In this paper, we propose two novel approaches, namely Parameter-Shared Branched Network (PSBN) and Weak Region of Interest Model-based segmentation (WRoIM) to identify disc and cup boundaries. Unlike the previous approaches, the proposed methods are trained endto-end through a single neural network architecture and use dynamic cropping instead of manual or traditional computer vision-based cropping. We are able to achieve similar performance as that of state-of-the-art approaches with less number of network parameters. Our experiments include comparison with different best known methods on publicly available Drishti-GS1 and RIM-ONE v3 datasets. With 7.8 × 10 6 parameters our approach achieves a Dice score of 0.96/0.89 for disc/cup segmentation on Drishti-GS1 data whereas the existing state-of-the-art approach uses 19.8 × 10 6 parameters to achieve a dice score of 0.97/0.89.
-
Structures of Ubl and Hsp90-like domains of sacsin provide insight into pathological mutations
Journal of Biological Chemistry
See publicationAutosomal recessive spastic ataxia of Charlevoix-Saguenay (ARSACS) is a neurodegenerative disease that is caused by mutations in the SACS gene. The product of this gene is a very large 520 kDa cytoplasmic protein, sacsin, with an ubiquitin-like domain (Ubl) at the N-terminus followed by three large sacsin internal repeats (SIRPTs) supradomains and C-terminal J and HEPN domains. The SIRPTs are predicted to contain Hsp90-like domains suggesting a potential chaperone activity. In this work, we…
Autosomal recessive spastic ataxia of Charlevoix-Saguenay (ARSACS) is a neurodegenerative disease that is caused by mutations in the SACS gene. The product of this gene is a very large 520 kDa cytoplasmic protein, sacsin, with an ubiquitin-like domain (Ubl) at the N-terminus followed by three large sacsin internal repeats (SIRPTs) supradomains and C-terminal J and HEPN domains. The SIRPTs are predicted to contain Hsp90-like domains suggesting a potential chaperone activity. In this work, we report the structures of the Hsp90-like Sr1 domain of SIRPT1 and the N-terminal Ubl domain determined at 1.55 Å and 2.1 Å resolution, respectively. The Ubl domain crystallized as a swapped dimer that could be relevant in the context of full-length protein. The Sr1 domain displays the Bergerat protein fold with a characteristic nucleotide-binding pocket, though it binds nucleotides with very low affinity. The Sr1 structure reveals that ARSACS-causing missense mutations (R272H, R272C and T201K) disrupt protein folding most likely leading to sacsin degradation. This work lends a structural support to the view of sacsin as a molecular chaperone and provides a framework for future studies of this protein.
-
Improved Extraction of Objects from Urine Microscopy Images with Unsupervised Thresholding and Supervised U-net Techniques
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition Workshops
See publicationWe propose a novel unsupervised method for extracting objects from urine microscopy images and also applied U-net for extracting these objects. We fused these proposed methods with a known edge thresholding technique from an existing work on segmentation of urine microscopic images. Comparison between our proposed methods and the existing work showed that for certain object types the proposed unsupervised method with or without edge thresholding outperforms the other methods, while in other…
We propose a novel unsupervised method for extracting objects from urine microscopy images and also applied U-net for extracting these objects. We fused these proposed methods with a known edge thresholding technique from an existing work on segmentation of urine microscopic images. Comparison between our proposed methods and the existing work showed that for certain object types the proposed unsupervised method with or without edge thresholding outperforms the other methods, while in other cases the U-net method with or without edge thresholding outperforms the other methods. Overall the proposed unsupervised method along with edge thresholding worked the best by extracting maximum number of objects and minimum number of artifacts. On a test dataset, the artifact to object ratio for the proposed unsupervised method was 0.71, which is significantly better than that of 1.26 for the existing work. The proposed unsupervised method along with edge thresholding extracted 3208 objects as compared to 1608 by the existing work. To the best of our knowledge this is the first application of Deep Learning for extraction of clinically significant objects in urine microscopy images.
-
Study on the performance of an artificial intelligence system for image based analysis of urine samples
Journal of Clinical & Experimental Pathology
See publicationIn this study, we evaluate the performance of Shrava, a cloud based Artificial Intelligence (AI) system for automated analysis of images captured from urine samples. Identification and morphological classification of objects in urine sediments by Shrava was compared with the results from Sysmex UF-1000i urine analyzer and manual microscopy. Thirty urine samples were analysed for the study, wherein, on an average, 50 different fields of views were captured at a magnification of 400x from slides…
In this study, we evaluate the performance of Shrava, a cloud based Artificial Intelligence (AI) system for automated analysis of images captured from urine samples. Identification and morphological classification of objects in urine sediments by Shrava was compared with the results from Sysmex UF-1000i urine analyzer and manual microscopy. Thirty urine samples were analysed for the study, wherein, on an average, 50 different fields of views were captured at a magnification of 400x from slides prepared from the samples. Classification of objects from the captured images was verified by three qualified medical experts and sensitivity, specificity, and accuracy of the classification results were calculated. Classification performance of Shrava was evaluated for RBCs, WBCs, crystals, epithelial cells and organisms (yeast and bacteria). The specificity for classification was above 97% for RBCs and above 99% for all other objects, while sensitivity was above 99% for yeast and epithelial cells, above 97% for RBCs, WBCs, and bacteria, and above 87% for crystals. Overall, classification accuracy for all objects was 96.4%. We also evaluated the sensitivity of Shrava for the above mentioned objects vis-a-vis reports obtained through a combination of urine analyser and manual microscopy and it was found to be 96.19%. Shrava was found to be effective in identifying and classifying objects in urine sediments. It saves time by aiding pathologists as a screening solution and also accelerates the turnaround time, thereby, increasing the productivity of pathologists and the laboratory.
-
Effective search space reduction for spell correction using character neural embeddings
European Chapter of the Association for Computational Linguistics Valencia, Spain, 2017
See publicationWe present a novel, unsupervised, and distance measure agnostic method for search space reduction in spell correction using neural character embeddings. The embeddings are learned by skip-gram word2vec training on sequences generated from dictionary words in a phonetic informationretentive manner. We report a very high performance in terms of both success rates and reduction of search space on the Birkbeck spelling error corpus.
-
Structural Studies of Sacsin
Acta Crystallographica Section A Foundations of Crystallography 68(a1):s163-s163
See publicationSacsin is a 520 kDa protein involved in an early onset neurodegenerative disease Autosomal Recessive Spastic Ataxia of Charlevoix-Saguenay (ARSACS) prevalent in the Charlevoix-Saguenay-Lac-Saint-Jean region of Quebec. Structural information about sacsin should help in clarifying the functional role of the protein and its involvement in neurodegeneration. The very C-terminus of the protein contains a Higher Eukaryotes and Prokaryotes Nucleotide-binding (HEPN) domain of unknown function. We…
Sacsin is a 520 kDa protein involved in an early onset neurodegenerative disease Autosomal Recessive Spastic Ataxia of Charlevoix-Saguenay (ARSACS) prevalent in the Charlevoix-Saguenay-Lac-Saint-Jean region of Quebec. Structural information about sacsin should help in clarifying the functional role of the protein and its involvement in neurodegeneration. The very C-terminus of the protein contains a Higher Eukaryotes and Prokaryotes Nucleotide-binding (HEPN) domain of unknown function. We determined a high-resolution 1.9 Å crystal structure of the HEPN domain from human sacsin [1]. The structure is made of five helices in an antiparallel arrangement with a long loop between helices 4 and 5 containing several short structured segments. Importantly, the HEPN domain forms a stable dimer in the crystal with a large buried surface formed by helices 1 and 2 and the 4- 5 loop. Multi-angle light scattering and NMR self-diffusion experiments confirmed that the sacsin HEPN domain also forms a dimer in solution. The structure explains why the N4549D mutation causes disease in some ARSACS patients. The mutation leads to electrostatic repulsion near dimerization interface, destabilizing the dimer and resulting in insoluble protein. The HEPN structure contains a large positively charged cavity at the dimer interface that is optimized for binding negatively charged ligands. Isothermal titration calorimetry and NMR titrations showed that this surface binds nucleotides with low micromolar affinity, though the identity of physiological ligand is currently unclear. Recently, we crystallized and solved the structure of the N-terminal ubiquitin-like (UBL) domain from human sacsin at 2.1 Å resolution. Unexpectedly, the structure shows a swapped dimer with the swapping mediated by exchange of first -strands of each protomer. The study provides important steps towards better understanding of structure/function relationships of sacsin and its involvement in neurodegeneration.
-
The unusual structure of the ubiquitin-like domain of the protein sacsin
McGill University Libraries
See publicationAutosomal recessive spastic ataxia of Charlevoix-Saguenay (ARSACS) is a progressive neurodegenerative disorder that first presents in early childhood, with prominent symptoms of spasticity in limbs, gait ataxia, slower motor development, muscle wasting and slurred speech. The disease is due to mutations in the SACS gene, which codes for sacsin, a large multidomain protein found in neurons. Loss of sacsin function is associated with progressive loss of cerebellar Purkinje cells and hyperfused…
Autosomal recessive spastic ataxia of Charlevoix-Saguenay (ARSACS) is a progressive neurodegenerative disorder that first presents in early childhood, with prominent symptoms of spasticity in limbs, gait ataxia, slower motor development, muscle wasting and slurred speech. The disease is due to mutations in the SACS gene, which codes for sacsin, a large multidomain protein found in neurons. Loss of sacsin function is associated with progressive loss of cerebellar Purkinje cells and hyperfused mitochondrial network, most likely due to loss of the mitochondrial fission efficiency. Most of sacsin is structurally and functionally uncharacterized. Sacsin is believed to play a role as a chaperone for promoting the folding of ataxia-related proteins. Bioinformatics analysis showed that sacsin contains an integral ubiquitin-like domain (UBL) domain, which was subsequently shown to weakly interact with the proteasomal subunit C-8 in the coimmunoprecipitation studies. The research work described in this thesis includes the incorporation of selenomethionine in the UBL sequence, crystallization of the selenomethionine-labeled UBL domain to produce well-diffracting crystals, and determination of the structure of the UBL domain by using the anomalous scattering signal from selenium. The UBL structure obtained is unusual as it is a swapped dimer formed by the exchange of the N-terminal portions of two molecules. The existence of dimer was confirmed in solution by PFG-NMR self-diffusion experiments. The hydrophobic patch that is usually responsible for interaction of the UBL domain with other proteins is occluded in the swapped dimer, which suggests that the sacsin UBL domain does not bind the proteasome as a dimer.
Patents
-
Methods and systems for predicting health of products in a manufacturing process
Filed US 16251924
See patentThe embodiments herein disclose methods and systems for predicting health of products in a manufacturing process. A method includes determining at least one of a dynamic data and a static data of at least one product from a manufacturing process steps. Further, the method includes determining and filtering at least one of a gradual change, an abrupt change and a similar data present in the at least one of the dynamic data and the static data. Further, the method includes converting the filtered…
The embodiments herein disclose methods and systems for predicting health of products in a manufacturing process. A method includes determining at least one of a dynamic data and a static data of at least one product from a manufacturing process steps. Further, the method includes determining and filtering at least one of a gradual change, an abrupt change and a similar data present in the at least one of the dynamic data and the static data. Further, the method includes converting the filtered at least one of the dynamic data and the static data of the at least one product into a common data format. The common data format can be stored in a common hyperspace. Further, the method includes predicting a health of the at least one product based on the common data format and the at least one product historical health information received from an apriori computer.
-
Method and device for integrating image channels in a deep learning model for classification
IN WO2019073312A1
See patentEmbodiments of present disclosure disclose method and device for integrating image channels in a deep learning model for classification of objects in a sample. Initially, at least one microscopic image of a sample comprising plurality of objects is received. Plurality of image channels is generated for the at least one microscopic image using at least one image operator. The plurality of image channels comprises at least one colour image channel and at least one hand-crafted image channel. Upon…
Embodiments of present disclosure disclose method and device for integrating image channels in a deep learning model for classification of objects in a sample. Initially, at least one microscopic image of a sample comprising plurality of objects is received. Plurality of image channels is generated for the at least one microscopic image using at least one image operator. The plurality of image channels comprises at least one colour image channel and at least one hand-crafted image channel. Upon the generation, the plurality of image channels is provided to a deep learning model to integrate the plurality of image channels with the deep learning model for classification of the plurality of objects.
-
Method and system for reconstructing a field of view
IN WO2019102272A1
See patentEmbodiments of present disclosure discloses system and method for reconstruction of FOV. Initially, presence of one of single object and distinct objects in FOV of image of sample comprising one or more objects is determined based on sharpness of one or more objects. A single optimal representation of FOV may be generated when presence of single object is determined. At least one of single optimal representation and a depth-based enhanced representation of FOV may be generated when presence of…
Embodiments of present disclosure discloses system and method for reconstruction of FOV. Initially, presence of one of single object and distinct objects in FOV of image of sample comprising one or more objects is determined based on sharpness of one or more objects. A single optimal representation of FOV may be generated when presence of single object is determined. At least one of single optimal representation and a depth-based enhanced representation of FOV may be generated when presence of distinct objects is determined. For generating depth-based enhanced representation, one or more first optimal images associated with each of distinct objects in FOV may be retrieved. An optimal representation of each of distinct objects is generated based on corresponding one or more first optimal images. Further, optimal representation of each of distinct objects is placed at corresponding optimal depth associated with respective distinct object to generate depth-based enhanced representation.
Courses
-
Advanced Machine Learning
CS 726
-
Applied Linear Algebra
EE 635
-
Foundations of Machine Learning
CS 725
-
Natural Language Processing
CS 626
-
Probabilistic Models
IE 502
Projects
-
CrowdAnalytix online competition for extraction of poduct attribtues
See projectGlobal rank: 12/173
Public leader board accuracy : 82.75%
This data science competition was hosted by CorwdAnalytix. Please find the details here:
https://www.crowdanalytix.com/contests/extraction-of-product-attribute-values -
Open source contribution to Deep Learning Library: DeepCTR
-
See projectI contribute to DeepCTR library by writing code for Click-Through Rate prediction models for publicly available research papers.
Languages
-
English
Native or bilingual proficiency
-
French
Limited working proficiency
-
Hindi
Native or bilingual proficiency
Recommendations received
6 people have recommended Harshit
Join now to viewMore activity by Harshit
-
Honoured to represent Paytm Travel at the FE Martech Awards 2025 under the MarTech for Social category! Grateful to Palash Kulshrestha, Shruti…
Honoured to represent Paytm Travel at the FE Martech Awards 2025 under the MarTech for Social category! Grateful to Palash Kulshrestha, Shruti…
Liked by Harshit Pande
-
Lab Farewell & Cycling trip- a break from our usual lab routine
Lab Farewell & Cycling trip- a break from our usual lab routine
Liked by Harshit Pande
-
We just published a paper in #Bioinformatics titled "PyEvoCell: an LLM-Augmented Single Cell Trajectory Analysis Dashboard". This version of the app…
We just published a paper in #Bioinformatics titled "PyEvoCell: an LLM-Augmented Single Cell Trajectory Analysis Dashboard". This version of the app…
Liked by Harshit Pande
-
Just wrapped up a fascinating course on quantum computing algorithms! Gained a deeper understanding of how quantum computers can outperform classical…
Just wrapped up a fascinating course on quantum computing algorithms! Gained a deeper understanding of how quantum computers can outperform classical…
Liked by Harshit Pande
-
Excited to share that this Summer, I'll be joining Netflix as an ML Research Intern
Excited to share that this Summer, I'll be joining Netflix as an ML Research Intern
Liked by Harshit Pande
-
Last week I attended France's AI Action Summit in Paris, where we convened Bloomberg clients and others to talk about the revolutionary potential for…
Last week I attended France's AI Action Summit in Paris, where we convened Bloomberg clients and others to talk about the revolutionary potential for…
Liked by Harshit Pande
-
I am looking for Principal Applied Scientist to come join a stellar team of 60+ engineering, science and product leaders to invent new products that…
I am looking for Principal Applied Scientist to come join a stellar team of 60+ engineering, science and product leaders to invent new products that…
Liked by Harshit Pande
-
I am hiring applied scientists in the GTA area. If you have a Master+ degree in CS or related areas and solid background in ML, NLP, and GenAI, check…
I am hiring applied scientists in the GTA area. If you have a Master+ degree in CS or related areas and solid background in ML, NLP, and GenAI, check…
Liked by Harshit Pande
-
🚀 From One Day to Day One!! 🚀 I’m thrilled to share that I’m joining Meta as a Software Engineer - ML! This marks the beginning of an exciting…
🚀 From One Day to Day One!! 🚀 I’m thrilled to share that I’m joining Meta as a Software Engineer - ML! This marks the beginning of an exciting…
Liked by Harshit Pande
Other similar profiles
Explore collaborative articles
We’re unlocking community knowledge in a new way. Experts add insights directly into each article, started with the help of AI.
Explore MoreOthers named Harshit Pande
-
Harshit Pande
Master's of Business Administration student specializing in Business Analytics | NMIMS, Mumbai | Passionate About Leveraging Data for Business Growth
-
Harshit Pande
Lawyer
-
Harshit Pande
Cloud Engineer | Azure | AWS | AI
-
Harshit Pande
Aspiring Software Engineer| Engineering Graduate | Learning DSA | Cloud & UI Explorer | Tech & Science | Passionate Learner
18 others named Harshit Pande are on LinkedIn
See others named Harshit Pande