Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
Filters








8 Hits in 3.5 sec

Lux: Always-on Visualization Recommendations for Exploratory Dataframe Workflows [article]

Doris Jung-Lin Lee, Dixin Tang, Kunal Agarwal, Thyne Boonmark, Caitlyn Chen, Jake Kang, Ujjaini Mukhopadhyay, Jerry Song, Micah Yong, Marti A. Hearst, Aditya G. Parameswaran
2021 arXiv   pre-print
Exploratory data science largely happens in computational notebooks with dataframe APIs, such as pandas, that support flexible means to transform, clean, and analyze data.  ...  Lux features a high level language for generating visualizations on demand to encourage rapid visual experimentation with data.  ...  (in the 10M case, on Kaggle for Airbnb and Communities.  ... 
arXiv:2105.00121v2 fatcat:gphjm3jqdzevbc4f6yvfyjod7i

EILEEN: A recommendation system for scientific publications and grants [article]

Daniel E. Acuna, Kartik Nagre, Priya Matnani
2022 arXiv   pre-print
This article describes EILEEN (Exploratory Innovator of LitEraturE Networks), a recommendation system for scientific publications and grants with open source code and datasets.  ...  We find that a learning-to-rank with Random Forest achieves an AUC of 0.9, significantly outperforming both baselines.  ...  Apache Spark is used with Python to pull the data from the above sources. The raw data obtained from the third-party sources is cleaned, processed using Spark's DataFrame and Machine Learning APIs.  ... 
arXiv:2110.09663v2 fatcat:bbm2xa2nkzd7hds4l6a7yuqhk4

OntoTouTra: Tourist Traceability Ontology Based on Big Data Analytics

Juan Francisco Mendoza-Moreno, Luz Santamaria-Granados, Anabel Fraga Vázquez, Gustavo Ramirez-Gonzalez
2021 Applied Sciences  
A knowledge base provides us with information on the preparation, planning, and implementation or operation stages.  ...  Some studies are related to the construction of ontologies in tourism, but none focus on tourist traceability systems.  ...  The analysis can be confirmatory or exploratory, depending on the deductive or inductive approach.  ... 
doi:10.3390/app112211061 fatcat:gzabwc344zfolhoxnoaf7ygxae

Scalability and Maintainability Challenges and Solutions in Machine/Deep Learning: A Systematic Literature Review

Karthik Shivashankar, Ghadi S. Al Hajj, Antonio Martini
2023 Zenodo  
Contributions: Our study presents (i) a catalogue of maintainability and scalability challenges and solutions in various stages of Data Engineering and Model Engineering workflows, as well as difficulties  ...  Methodology: We conducted a systematic literature review, initially screening over 17,000 papers and subsequently selecting and reviewing 124 papers to be included in this study.  ...  In our study, we validate the recommenda ons of these two experts with a case study with seventeen par cipants.  ... 
doi:10.5281/zenodo.8024833 fatcat:s2elkgq6trbzrepv5pvn6ql444

International Conference on Recent and Future Trends in Smart Electronics System and Manufacturing [chapter]

Ketan Kotecha, Pritesh Shah, Ujwala Kshirsagar, Durgesh Nandan, M.V.V. Prasad Kantipudi
2023 International Conference on Recent and Future Trends in Smart Electronics System and Manufacturing  
So current study works on the reviews whose ratings are not provided by the customers.  ...  Reviews and ratings on the sites will play a vital role in improving global communications among the customers and it has the potential to influence consumer buying patterns as well.  ...  The algorithm also outperforms the convergence speed to track the MPP with a response time of less than 0.1 ms.  ... 
doi:10.13052/rp-9788770229852 fatcat:tuaudbqo2jci3afo37ay4us22q

Model Sketching: Centering Concepts in Early-Stage Machine Learning Model Design [article]

Michelle S. Lam, Zixian Ma, Anne Li, Izequiel Freitas, Dakuo Wang, James A. Landay, Michael S. Bernstein
2023 pre-print
In an evaluation with 17 ML practitioners, model sketching reframed thinking from implementation to higher-level exploration, prompted iteration on a broader range of model designs, and helped identify  ...  Could early model development instead focus on high-level questions of which factors a model ought to pay attention to?  ...  This work was partially supported by IBM as a founding member of the Stanford Institute for Human-centered Artificial Intelligence (HAI). Michelle S.  ... 
doi:10.1145/3544548.3581290 arXiv:2303.02884v1 fatcat:p7tjf3r42vdiroquh3rf5qhmxa

Continuous Analysis, Monitoring, and Comparison of Student Project Portfolios in Software Engineering Courses

Manuel Stöger, Thomas Grechenig
2023
The primary objective is to prototype a data-driven tool for continuous analysis, monitoring, and comparison of software repositories, with the goal of enhancing the quality of teaching in the field of  ...  This thesis suggests the use of software repository data mining in combination with data visualization in a university setting to address the current gap in this area.  ...  In addition, I would like to thank the admin of the SEPM class who provided me with the data and regularly gave me information on all organizational and technical issues of the course.  ... 
doi:10.34726/hss.2023.109702 fatcat:2ck5yn72ubccdjshkkbs576p6q

Process Analysis for Marketing Research

Constant Pieters
2020
Study 2b is a large-scale survey of moviegoers merged with archival data from a film database. It investigates referral reinforcement when referrals are organic, not incentivized.  ...  Study 2b: Referral reinforcement effects among moviegoers. Study 2b is a large survey of moviegoers merged with IMDb data on movie quality ratings and gross earnings.  ...  A follow-up analysis of the data in the Appendix of Herda (2013) found a negative correlation between the natural logarithm of the number of constructs in a theory with publication year (r = -.32, p  ... 
doi:10.26116/center-lis-2009 fatcat:54xdlznhdbfgxkewl3pe5rssva