• Qararyah F, Wahib M, Dikbayır D, Belviranli M and Unat D. (2021). A computational-graph partitioning method for training memory-constrained DNNs. Parallel Computing. 10.1016/j.parco.2021.102792. 104-105. (102792). Online publication date: 1-Jul-2021.

    https://linkinghub.elsevier.com/retrieve/pii/S0167819121000454

  • Faes M and Gross T. (2019). Parallel Roles for Practical Deterministic Parallel Programming. Languages and Compilers for Parallel Computing. 10.1007/978-3-030-35225-7_12. (163-181).

    http://link.springer.com/10.1007/978-3-030-35225-7_12

  • Tillenius M. (2015). SuperGlue: A Shared Memory Framework Using Data Versioning for Dependency-Aware Task-Based Parallelization. SIAM Journal on Scientific Computing. 10.1137/140989716. 37:6. (C617-C642). Online publication date: 1-Jan-2015.

    http://epubs.siam.org/doi/10.1137/140989716

  • Soparkar N, Krzyzanowski P, Jagadish H and Asthana A. Run-time parallelization of sequential database programs. Proceedings of the fourth international conference on Information and knowledge management. (74-81).

    https://doi.org/10.1145/221270.221331

  • Sharma S, Baugh J and Chadha H. (1993). A client—server approach for distributed finite element analysis. Advances in Engineering Software. 10.1016/0965-9978(93)90043-S. 17:2. (69-78). Online publication date: 1-Jan-1993.

    http://linkinghub.elsevier.com/retrieve/pii/096599789390043S

  • Fan X, Liao X and Jin H. (2019). FunctionFlow. Frontiers of Computer Science: Selected Publications from Chinese Universities. 13:1. (73-85). Online publication date: 1-Feb-2019.

    https://doi.org/10.1007/s11704-016-6286-8

  • Faes M and Gross T. (2018). Concurrency-aware object-oriented programming with roles. Proceedings of the ACM on Programming Languages. 2:OOPSLA. (1-30). Online publication date: 24-Oct-2018.

    https://doi.org/10.1145/3276500

  • Yazdanpanah F and Alaei M. (2015). Design space exploration of hardware task superscalar architecture. The Journal of Supercomputing. 71:9. (3567-3592). Online publication date: 1-Sep-2015.

    https://doi.org/10.1007/s11227-015-1449-1

  • Koskinen E, Parkinson M and Herlihy M. Coarse-grained transactions. Proceedings of the 37th annual ACM SIGPLAN-SIGACT symposium on Principles of programming languages. (19-30).

    https://doi.org/10.1145/1706299.1706304

  • Koskinen E, Parkinson M and Herlihy M. (2010). Coarse-grained transactions. ACM SIGPLAN Notices. 45:1. (19-30). Online publication date: 2-Jan-2010.

    https://doi.org/10.1145/1707801.1706304

  • Aleen F and Clark N. Commutativity analysis for software parallelization. Proceedings of the 14th international conference on Architectural support for programming languages and operating systems. (241-252).

    https://doi.org/10.1145/1508244.1508273

  • Aleen F and Clark N. (2009). Commutativity analysis for software parallelization. ACM SIGARCH Computer Architecture News. 37:1. (241-252). Online publication date: 1-Mar-2009.

    https://doi.org/10.1145/2528521.1508273

  • Aleen F and Clark N. (2009). Commutativity analysis for software parallelization. ACM SIGPLAN Notices. 44:3. (241-252). Online publication date: 28-Feb-2009.

    https://doi.org/10.1145/1508284.1508273

  • Mandviwala H, Ramachandran U and Knobe K. Capsules. Languages and Compilers for Parallel Computing. (276-291).

    https://doi.org/10.1007/978-3-540-85261-2_19

  • Saad E, Adawy M, Keshk H and Habashy S. (2006). Reconfigurable Parallel Processing System Based on A Modified Ant Colony Algorithm Proceedings of the Twenty Third National Radio Science Conference (NRSC'2006). 10.1109/NRSC.2006.386362. 977-5031-84-2. (1-11).

    http://ieeexplore.ieee.org/document/4275159/

  • Saad E, Adawy M, Keshk H and Habashy S. (2006). Task Graph Generation Proceedings of the Twenty Third National Radio Science Conference (NRSC'2006). 10.1109/NRSC.2006.386361. 977-5031-84-2. (1-9).

    http://ieeexplore.ieee.org/document/4275158/

  • Chan B and Abdelrahman T. (2004). Run-Time Support for the Automatic Parallelization of Java Programs. The Journal of Supercomputing. 28:1. (91-117). Online publication date: 1-Apr-2004.

    https://doi.org/10.1023/B:SUPE.0000014804.20789.21

  • Advea V and Sakellariou R. (2001). Compiler Synthesis of Task Graphs for Parallel Program Performance Prediction. Languages and Compilers for Parallel Computing. 10.1007/3-540-45574-4_14. (208-226).

    http://link.springer.com/10.1007/3-540-45574-4_14

  • Alverson G, Griswold W, Lin C, Notkin D and Snyder L. (1998). Abstractions for Portable, Scalable Parallel Programming. IEEE Transactions on Parallel and Distributed Systems. 9:1. (71-86). Online publication date: 1-Jan-1998.

    https://doi.org/10.1109/71.655246

  • Sandhu H and Sevcik K. An analytic study of dynamic hardware and software cache coherence strategies. Proceedings of the 1995 ACM SIGMETRICS joint international conference on Measurement and modeling of computer systems. (167-177).

    https://doi.org/10.1145/223587.223606

  • Sandhu H and Sevcik K. (1995). An analytic study of dynamic hardware and software cache coherence strategies. ACM SIGMETRICS Performance Evaluation Review. 23:1. (167-177). Online publication date: 1-May-1995.

    https://doi.org/10.1145/223586.223606

  • Sato M, Kodama Y, Sakai S and Yamaguchi Y. (1995). EM-C: Programming with Explicit Parallelism and Locality for EM-4 Multiprocessor. Parallel Language and Compiler Research in Japan. 10.1007/978-1-4615-2269-0_3. (47-73).

    http://link.springer.com/10.1007/978-1-4615-2269-0_3

  • Subhlok J, O'Hallaron D, Gross T, Dinda P and Webb J. Communication and memory requirements as the basis for mapping task and data parallel programs. Proceedings of the 1994 ACM/IEEE conference on Supercomputing. (330-339).

    /doi/10.5555/602770.602832

  • Sato M, Kodama Y, Sakai S and Yamaguchi Y. EM-C. Proceedings of the IFIP WG10.3 Working Conference on Parallel Architectures and Compilation Techniques. (3-14).

    /doi/10.5555/647042.713831

  • Jagannathan S and Weeks S. (1994). Analyzing stores and references in a parallel symbolic language. ACM SIGPLAN Lisp Pointers. VII:3. (294-305). Online publication date: 1-Jul-1994.

    https://doi.org/10.1145/182590.182493

  • Jagannathan S and Weeks S. Analyzing stores and references in a parallel symbolic language. Proceedings of the 1994 ACM conference on LISP and functional programming. (294-305).

    https://doi.org/10.1145/182409.182493

  • Halstead B, Callahan D, Dennis J, Nikhil R and Sarkar V. (1994). Programming, compilation, and resource management issues for multithreading (panel session II). ACM SIGARCH Computer Architecture News. 22:1. (19-33). Online publication date: 1-Mar-1994.

    https://doi.org/10.1145/181993.181997

  • Lee J. Concord: re-thinking the division of labor in a distributed shared memory system IEEE Scalable High Performance Computing Conference. 10.1109/SHPCC.1994.296694. 0-8186-5680-8. (585-592).

    http://ieeexplore.ieee.org/document/296694/

  • Alt M, Aßmann U and Someren H. (1994). Cosy compiler phase embedding with the CoSy compiler model. Compiler Construction. 10.1007/3-540-57877-3_19. (278-293).

    http://link.springer.com/10.1007/3-540-57877-3_19

  • Kumar P. Language support for data parallelism in pointer based dynamic data structures. Proceedings of the 1993 conference of the Centre for Advanced Studies on Collaborative research: distributed computing - Volume 2. (933-941).

    /doi/10.5555/962367.962398

  • Sandhu H, Gamsa B and Zhou S. The shared regions approach to software cache coherence on multiprocessors. Proceedings of the fourth ACM SIGPLAN symposium on Principles and practice of parallel programming. (229-238).

    https://doi.org/10.1145/155332.155356

  • Subhlok J, Stichnoth J, O'Hallaron D and Gross T. Exploiting task and data parallelism on a multicomputer. Proceedings of the fourth ACM SIGPLAN symposium on Principles and practice of parallel programming. (13-22).

    https://doi.org/10.1145/155332.155334

  • Sandhu H, Gamsa B and Zhou S. (1993). The shared regions approach to software cache coherence on multiprocessors. ACM SIGPLAN Notices. 28:7. (229-238). Online publication date: 1-Jul-1993.

    https://doi.org/10.1145/173284.155356

  • Subhlok J, Stichnoth J, O'Hallaron D and Gross T. (1993). Exploiting task and data parallelism on a multicomputer. ACM SIGPLAN Notices. 28:7. (13-22). Online publication date: 1-Jul-1993.

    https://doi.org/10.1145/173284.155334

  • Gross T, Hinrichs S, Lueh G, O'Hallaron D, Stichnoch J and Subhlok J. (1993). Compiling task and data parallel programs for iWarp. ACM SIGPLAN Notices. 28:1. (32-35). Online publication date: 1-Jan-1993.

    https://doi.org/10.1145/156668.156682

  • Wei Y, Shi S and Lin D. (1993). Converting monolithic programs for DCE client-server computing given incomplete cutset information. DCE — The OSF Distributed Computing Environment Client/Server Model and Beyond. 10.1007/3-540-57306-2_31. (135-146).

    http://link.springer.com/10.1007/3-540-57306-2_31

  • Rinard M, Scales D and Lam M. Heterogeneous parallel programming in Jade. Proceedings of the 1992 ACM/IEEE conference on Supercomputing. (245-256).

    /doi/10.5555/147877.148003

  • Rinard M and Lam M. Semantic foundations of Jade. Proceedings of the 19th ACM SIGPLAN-SIGACT symposium on Principles of programming languages. (105-118).

    https://doi.org/10.1145/143165.143189

  • Weiren Ding and Hendren L. SELSYN-C: a self-synchronizing parallel programming language ICCI `92: Fourth International Conference on Computing and Information. 10.1109/ICCI.1992.227685. 0-8186-2812-X. (143-146).

    http://ieeexplore.ieee.org/document/227685/

  • Scales D, Rinard M, Lam M and Anderson J. Hierarchical concurrency in Jade. Languages and Compilers for Parallel Computing. 10.1007/BFb0038657. (50-64).

    http://www.springerlink.com/index/10.1007/BFb0038657

  • Gupta A. (1992). Stanford DASH multiprocessor: The hardware and software approach. PARLE '92 Parallel Architectures and Languages Europe. 10.1007/3-540-55599-4_125. (799-805).

    http://link.springer.com/10.1007/3-540-55599-4_125