[41] M. Klettke. Evolution management of multi-model data (position paper).
[42] P. Konda, S. Das, P. Suganthan GC, A. Doan, A. Ardalan, J. R. Ballard,
H. Li, F. Panahi, H. Zhang, J. Naughton, et al. Magellan: Toward
building entity matching management systems. Proceedings of the VLDB
Endowment, 9(12):1197–1208, 2016.
[43] T. Kraska, A. Beutel, E. H. Chi, J. Dean, and N. Polyzotis. The case
for learned index structures. In Proceedings of the 2018 International
Conference on Management of Data, pages 489–504, 2018.
[44] S. Krishnan, Z. Yang, K. Goldberg, J. Hellerstein, and I. Stoica. Learning
to optimize join queries with deep reinforcement learning. arXiv preprint
arXiv:1808.03196, 2018.
[45] A. Kurakin, I. Goodfellow, and S. Bengio. Adversarial machine learning
at scale. arXiv preprint arXiv:1611.01236, 2016.
[46] Y. LeCun, Y. Bengio, and G. Hinton. Deep learning. nature,
521(7553):436–444, 2015.
[47] G. Li, X. Zhou, S. Li, and B. Gao. Qtune: A query-aware database
tuning system with deep reinforcement learning. Proceedings of the
VLDB Endowment, 12(12):2118–2130, 2019.
[48] X.-L. Mao, B.-S. Feng, Y.-J. Hao, L. Nie, H. Huang, and G. Wen. S2jsd-
lsh: A locality-sensitive hashing schema for probability distributions. In
Proceedings of the Thirty-First AAAI Conference on Artificial Intelli-
gence, pages 3244–3251, 2017.
[49] V. V. Meduri, L. Popa, P. Sen, and M. Sarwat. A comprehensive
benchmark framework for active learning methods in entity matching.
In Proceedings of the 2020 ACM SIGMOD International Conference on
Management of Data, pages 1133–1147, 2020.
[50] R. J. Miller. Open data integration. Proceedings of the VLDB
Endowment, 11(12):2130–2139, 2018.
[51] R. J. Miller, L. M. Haas, and M. A. Hern
´
andez. Schema mapping as
query discovery. In VLDB, volume 2000, pages 77–88, 2000.
[52] M. J. Mior, K. Salem, A. Aboulnaga, and R. Liu. Nose: Schema design
for nosql applications. IEEE Transactions on Knowledge and Data
Engineering, 29(10):2275–2289, 2017.
[53] M. L. M
¨
oller, M. Klettke, A. Hillenbrand, and U. St
¨
orl. Query rewriting
for continuously evolving nosql databases. In International Conference
on Conceptual Modeling, pages 213–221. Springer, 2019.
[54] H. J. Moon, C. A. Curino, A. Deutsch, C.-Y. Hou, and C. Zaniolo. Man-
aging and querying transaction-time databases under schema evolution.
Proceedings of the VLDB Endowment, 1(1):882–895, 2008.
[55] H. J. Moon, C. A. Curino, M. Ham, and C. Zaniolo. Prima: archiving
and querying historical data with evolving schemas. In Proceedings of
the 2009 ACM SIGMOD International Conference on Management of
data, pages 1019–1022, 2009.
[56] H. J. Moon, C. A. Curino, and C. Zaniolo. Scalable architecture and
query optimization fortransaction-time dbs with evolving schemas. In
Proceedings of the 2010 ACM SIGMOD International Conference on
Management of data, pages 207–218, 2010.
[57] S. Mudgal, H. Li, T. Rekatsinas, A. Doan, Y. Park, G. Krishnan, R. Deep,
E. Arcaute, and V. Raghavendra. Deep learning for entity matching:
A design space exploration. In Proceedings of the 2018 International
Conference on Management of Data, pages 19–34, 2018.
[58] F. Nargesian, K. Q. Pu, E. Zhu, B. Ghadiri Bashardoost, and R. J.
Miller. Organizing data lakes for navigation. In Proceedings of the
2020 ACM SIGMOD International Conference on Management of Data,
pages 1939–1950, 2020.
[59] F. Nargesian, E. Zhu, K. Q. Pu, and R. J. Miller. Table union search on
open data. Proceedings of the VLDB Endowment, 11(7):813–825, 2018.
[60] S. Palkar, J. J. Thomas, A. Shanbhag, D. Narayanan, H. Pirk,
M. Schwarzkopf, S. Amarasinghe, M. Zaharia, and S. InfoLab. Weld:
A common runtime for high performance data analytics. In Conference
on Innovative Data Systems Research (CIDR), page 45, 2017.
[61] L. Popa, M. A. Hernandez, Y. Velegrakis, R. J. Miller, F. Naumann, and
H. Ho. Mapping xml and relational schemas with clio. In Proceedings
18th International Conference on Data Engineering, pages 498–499.
IEEE, 2002.
[62] A. Radford, J. Wu, R. Child, D. Luan, D. Amodei, and I. Sutskever.
Language models are unsupervised multitask learners. OpenAI Blog,
1(8):9, 2019.
[63] E. Rahm and P. A. Bernstein. An online bibliography on schema
evolution. ACM Sigmod Record, 35(4):30–31, 2006.
[64] S. Scherzinger, M. Klettke, and U. St
¨
orl. Managing schema evolution
in nosql data stores. arXiv preprint arXiv:1308.0514, 2013.
[65] J. Schmidhuber. Deep learning in neural networks: An overview. Neural
networks, 61:85–117, 2015.
[66] Y. Shen, K. Chakrabarti, S. Chaudhuri, B. Ding, and L. Novik. Dis-
covering queries based on example tuples. In Proceedings of the 2014
ACM SIGMOD international conference on Management of data, pages
493–504, 2014.
[67] Y. Sheng. Non-blocking Lazy Schema Changes in Multi-Version
Database Management Systems. PhD thesis, Carnegie Mellon University
Pittsburgh, PA, 2019.
[68] A. Shrivastava, T. Pfister, O. Tuzel, J. Susskind, W. Wang, and R. Webb.
Learning from simulated and unsupervised images through adversarial
training. In Proceedings of the IEEE conference on computer vision and
pattern recognition, pages 2107–2116, 2017.
[69] M. Stonebraker and I. F. Ilyas. Data integration: The current status and
the way forward. IEEE Data Eng. Bull., 41(2):3–9, 2018.
[70] U. St
¨
orl, M. Klettke, and S. Scherzinger. Nosql schema evolution and
data migration: State-of-the-art and opportunities. In EDBT, pages 655–
658, 2020.
[71] D. G. Sullivan, M. I. Seltzer, and A. Pfeffer. Using probabilistic
reasoning to automate software tuning. ACM SIGMETRICS Performance
Evaluation Review, 32(1):404–405, 2004.
[72] B. Ten Cate, P. G. Kolaitis, and W.-C. Tan. Schema mappings and
data examples. In Proceedings of the 16th International Conference on
Extending Database Technology, pages 777–780, 2013.
[73] S. Thirumuruganathan, S. A. P. Parambath, M. Ouzzani, N. Tang, and
S. Joty. Reuse and adaptation for entity resolution through transfer
learning. arXiv preprint arXiv:1809.11084, 2018.
[74] S. Thirumuruganathan, N. Tang, M. Ouzzani, and A. Doan. Data
curation with deep learning [vision]. arXiv preprint arXiv:1803.01384,
2018.
[75] A. Tsymbal. The problem of concept drift: definitions and related work.
Computer Science Department, Trinity College Dublin, 106(2):58, 2004.
[76] D. Van Aken, A. Pavlo, G. J. Gordon, and B. Zhang. Automatic database
management system tuning through large-scale machine learning. In
Proceedings of the 2017 ACM International Conference on Management
of Data, pages 1009–1024, 2017.
[77] Y. Velegrakis, R. J. Miller, and L. Popa. Preserving mapping consistency
under schema changes. The VLDB Journal, 13(3):274–293, 2004.
[78] L. Wang, S. Zhang, J. Shi, L. Jiao, O. Hassanzadeh, J. Zou, and
C. Wangz. Schema management for document stores. Proceedings of
the VLDB Endowment, 8(9):922–933, 2015.
[79] C. Xiao, W. Wang, and X. Lin. Ed-join: an efficient algorithm for
similarity joins with edit distance constraints. Proceedings of the VLDB
Endowment, 1(1):933–944, 2008.
[80] C. Yu and L. Popa. Semantic adaptation of schema mappings when
schemas evolve. In Proceedings of the 31st international conference on
Very large data bases, pages 1006–1017. VLDB Endowment, 2005.
[81] J. Zhang, Y. Liu, K. Zhou, G. Li, Z. Xiao, B. Cheng, J. Xing, Y. Wang,
T. Cheng, L. Liu, et al. An end-to-end automatic cloud database tuning
system using deep reinforcement learning. In Proceedings of the 2019
International Conference on Management of Data, pages 415–432, 2019.
[82] C. Zhao and Y. He. Auto-em: End-to-end fuzzy entity-matching using
pre-trained deep models and transfer learning. In The World Wide Web
Conference, pages 2413–2424, 2019.
[83] E. Zhu, D. Deng, F. Nargesian, and R. J. Miller. Josie: Overlap set
similarity search for finding joinable tables in data lakes. In Proceedings
of the 2019 International Conference on Management of Data, pages
847–864, 2019.
[84] E. Zhu, Y. He, and S. Chaudhuri. Auto-join: Joining tables by leveraging
transformations. Proceedings of the VLDB Endowment, 10(10):1034–
1045, 2017.
[85] E. Zhu, F. Nargesian, K. Q. Pu, and R. J. Miller. Lsh ensemble: Internet-
scale domain search. arXiv preprint arXiv:1603.07410, 2016.
[86] J. Zou, P. Barhate, A. Das, A. Iyengar, B. Yuan, D. Jankov, and
C. Jermaine. Lachesis: Automated generation of persistent partitionings
for big data applications. arXiv preprint arXiv:2006.16529, 2020.