A copy of this work was available on the public web and has been preserved in the Wayback Machine. The capture dates from 2021; you can also visit the original URL.
The file type is application/pdf
.
Filters
Lux: Always-on Visualization Recommendations for Exploratory Dataframe Workflows
[article]
2021
arXiv
pre-print
Exploratory data science largely happens in computational notebooks with dataframe APIs, such as pandas, that support flexible means to transform, clean, and analyze data. ...
Lux features a high level language for generating visualizations on demand to encourage rapid visual experimentation with data. ...
(in the 10M case,
on Kaggle for Airbnb and Communities. ...
arXiv:2105.00121v2
fatcat:gphjm3jqdzevbc4f6yvfyjod7i
EILEEN: A recommendation system for scientific publications and grants
[article]
2022
arXiv
pre-print
This article describes EILEEN (Exploratory Innovator of LitEraturE Networks), a recommendation system for scientific publications and grants with open source code and datasets. ...
We find that a learning-to-rank with Random Forest achieves an AUC of 0.9, significantly outperforming both baselines. ...
Apache Spark is used with Python to pull the data from the above sources. The raw data obtained from the third-party sources is cleaned, processed using Spark's DataFrame and Machine Learning APIs. ...
arXiv:2110.09663v2
fatcat:bbm2xa2nkzd7hds4l6a7yuqhk4
OntoTouTra: Tourist Traceability Ontology Based on Big Data Analytics
2021
Applied Sciences
A knowledge base provides us with information on the preparation, planning, and implementation or operation stages. ...
Some studies are related to the construction of ontologies in tourism, but none focus on tourist traceability systems. ...
The analysis can be confirmatory or exploratory, depending on the deductive or inductive approach. ...
doi:10.3390/app112211061
fatcat:gzabwc344zfolhoxnoaf7ygxae
Scalability and Maintainability Challenges and Solutions in Machine/Deep Learning: A Systematic Literature Review
2023
Zenodo
Contributions: Our study presents (i) a catalogue of maintainability and scalability challenges and solutions in various stages of Data Engineering and Model Engineering workflows, as well as difficulties ...
Methodology: We conducted a systematic literature review, initially screening over 17,000 papers and subsequently selecting and reviewing 124 papers to be included in this study. ...
In our study, we validate the recommenda ons of these two experts with a case study with seventeen par cipants. ...
doi:10.5281/zenodo.8024833
fatcat:s2elkgq6trbzrepv5pvn6ql444
International Conference on Recent and Future Trends in Smart Electronics System and Manufacturing
[chapter]
2023
International Conference on Recent and Future Trends in Smart Electronics System and Manufacturing
So current study works on the reviews whose ratings are not provided by the customers. ...
Reviews and ratings on the sites will play a vital role in improving global communications among the customers and it has the potential to influence consumer buying patterns as well. ...
The algorithm also outperforms the convergence speed to track the MPP with a response time of less than 0.1 ms. ...
doi:10.13052/rp-9788770229852
fatcat:tuaudbqo2jci3afo37ay4us22q
Model Sketching: Centering Concepts in Early-Stage Machine Learning Model Design
[article]
2023
pre-print
In an evaluation with 17 ML practitioners, model sketching reframed thinking from implementation to higher-level exploration, prompted iteration on a broader range of model designs, and helped identify ...
Could early model development instead focus on high-level questions of which factors a model ought to pay attention to? ...
This work was partially supported by IBM as a founding member of the Stanford Institute for Human-centered Artificial Intelligence (HAI). Michelle S. ...
doi:10.1145/3544548.3581290
arXiv:2303.02884v1
fatcat:p7tjf3r42vdiroquh3rf5qhmxa
Continuous Analysis, Monitoring, and Comparison of Student Project Portfolios in Software Engineering Courses
2023
The primary objective is to prototype a data-driven tool for continuous analysis, monitoring, and comparison of software repositories, with the goal of enhancing the quality of teaching in the field of ...
This thesis suggests the use of software repository data mining in combination with data visualization in a university setting to address the current gap in this area. ...
In addition, I would like to thank the admin of the SEPM class who provided me with the data and regularly gave me information on all organizational and technical issues of the course. ...
doi:10.34726/hss.2023.109702
fatcat:2ck5yn72ubccdjshkkbs576p6q
Process Analysis for Marketing Research
2020
Study 2b is a large-scale survey of moviegoers merged with archival data from a film database. It investigates referral reinforcement when referrals are organic, not incentivized. ...
Study 2b: Referral reinforcement effects among moviegoers. Study 2b is a large survey of moviegoers merged with IMDb data on movie quality ratings and gross earnings. ...
A follow-up analysis of the data in the Appendix of Herda (2013) found a negative correlation between the natural logarithm of the number of constructs in a theory with publication year (r = -.32, p ...
doi:10.26116/center-lis-2009
fatcat:54xdlznhdbfgxkewl3pe5rssva