

<?xml version="1.0"?>
<feed xmlns="http://www.w3.org/2005/Atom" xml:lang="en">
	<id>https://strike.scec.org/scecwiki/index.php?action=history&amp;feed=atom&amp;title=AWP_ODC_GPU</id>
	<title>AWP ODC GPU - Revision history</title>
	<link rel="self" type="application/atom+xml" href="https://strike.scec.org/scecwiki/index.php?action=history&amp;feed=atom&amp;title=AWP_ODC_GPU"/>
	<link rel="alternate" type="text/html" href="https://strike.scec.org/scecwiki/index.php?title=AWP_ODC_GPU&amp;action=history"/>
	<updated>2026-05-09T12:05:26Z</updated>
	<subtitle>Revision history for this page on the wiki</subtitle>
	<generator>MediaWiki 1.34.2</generator>
	<entry>
		<id>https://strike.scec.org/scecwiki/index.php?title=AWP_ODC_GPU&amp;diff=9530&amp;oldid=prev</id>
		<title>Maechlin at 18:40, 13 February 2014</title>
		<link rel="alternate" type="text/html" href="https://strike.scec.org/scecwiki/index.php?title=AWP_ODC_GPU&amp;diff=9530&amp;oldid=prev"/>
		<updated>2014-02-13T18:40:16Z</updated>

		<summary type="html">&lt;p&gt;&lt;/p&gt;
&lt;table class=&quot;diff diff-contentalign-left&quot; data-mw=&quot;interface&quot;&gt;
				&lt;col class=&quot;diff-marker&quot; /&gt;
				&lt;col class=&quot;diff-content&quot; /&gt;
				&lt;col class=&quot;diff-marker&quot; /&gt;
				&lt;col class=&quot;diff-content&quot; /&gt;
				&lt;tr class=&quot;diff-title&quot; lang=&quot;en&quot;&gt;
				&lt;td colspan=&quot;2&quot; style=&quot;background-color: #fff; color: #222; text-align: center;&quot;&gt;← Older revision&lt;/td&gt;
				&lt;td colspan=&quot;2&quot; style=&quot;background-color: #fff; color: #222; text-align: center;&quot;&gt;Revision as of 18:40, 13 February 2014&lt;/td&gt;
				&lt;/tr&gt;&lt;tr&gt;&lt;td colspan=&quot;2&quot; class=&quot;diff-lineno&quot; id=&quot;mw-diff-left-l1&quot; &gt;Line 1:&lt;/td&gt;
&lt;td colspan=&quot;2&quot; class=&quot;diff-lineno&quot;&gt;Line 1:&lt;/td&gt;&lt;/tr&gt;
&lt;tr&gt;&lt;td class='diff-marker'&gt;−&lt;/td&gt;&lt;td style=&quot;color: #222; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #ffe49c; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;div&gt;&lt;del class=&quot;diffchange diffchange-inline&quot;&gt;Efecan Poyraz, a research intern working with &lt;/del&gt;Yifeng Cui, Kim Olsen, and others on SCEC projects &lt;del class=&quot;diffchange diffchange-inline&quot;&gt;on &lt;/del&gt;SCEC's first major GPU &lt;del class=&quot;diffchange diffchange-inline&quot;&gt;simulation in preparation&lt;/del&gt;. &lt;del class=&quot;diffchange diffchange-inline&quot;&gt;Efecan &lt;/del&gt;is &lt;del class=&quot;diffchange diffchange-inline&quot;&gt;trying to run &lt;/del&gt;a full scale Chino Hills validation ground motion simulation, at size of 3500x2500x1500 problem size, 16m resolution, we used 250 GPUs on Keeneland, the initial test went well. We are seeing 55x speedup comparing to the CPU code for the same setting, we know Keeneland may have some scaling issue when using high number of nodes, plus this simulation has serious I/O - around 0.5TB, requiring frequent data transfer from GPU to CPU through PCIe. However, we are pretty happy with the performance getting so far. One issue with current GPU code is that we don't have validation prints about velocity max/min value, for that the simulation results couldn't be validated at run time. We are working with our visualization expert to analyze the initial results, and also work with seismologist Kim Olsen. Once everything is set, we'll make reservation next week or so for a full size run which possibly take 5 hours to complete.&lt;/div&gt;&lt;/td&gt;&lt;td class='diff-marker'&gt;+&lt;/td&gt;&lt;td style=&quot;color: #222; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #a3d3ff; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;div&gt;Yifeng Cui, Kim Olsen&lt;ins class=&quot;diffchange diffchange-inline&quot;&gt;, Efecan Poyraz&lt;/ins&gt;, and others on SCEC projects &lt;ins class=&quot;diffchange diffchange-inline&quot;&gt;have developed &lt;/ins&gt;SCEC's first major GPU &lt;ins class=&quot;diffchange diffchange-inline&quot;&gt;software&lt;/ins&gt;. &lt;ins class=&quot;diffchange diffchange-inline&quot;&gt;This group &lt;/ins&gt;is &lt;ins class=&quot;diffchange diffchange-inline&quot;&gt;running &lt;/ins&gt;a full scale Chino Hills validation ground motion simulation, at size of 3500x2500x1500 problem size, 16m resolution, we used 250 GPUs on Keeneland, the initial test went well. We are seeing 55x speedup comparing to the CPU code for the same setting, we know Keeneland may have some scaling issue when using high number of nodes, plus this simulation has serious I/O - around 0.5TB, requiring frequent data transfer from GPU to CPU through PCIe. However, we are pretty happy with the performance getting so far. One issue with current GPU code is that we don't have validation prints about velocity max/min value, for that the simulation results couldn't be validated at run time. We are working with our visualization expert to analyze the initial results, and also work with seismologist Kim Olsen. Once everything is set, we'll make reservation next week or so for a full size run which possibly take 5 hours to complete.&lt;/div&gt;&lt;/td&gt;&lt;/tr&gt;
&lt;tr&gt;&lt;td class='diff-marker'&gt; &lt;/td&gt;&lt;td style=&quot;background-color: #f8f9fa; color: #222; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #eaecf0; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;div&gt;[[Image:Screen Shot 2012-11-20 at 9.57.51 AM.png|right|150px]]&lt;/div&gt;&lt;/td&gt;&lt;td class='diff-marker'&gt; &lt;/td&gt;&lt;td style=&quot;background-color: #f8f9fa; color: #222; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #eaecf0; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;div&gt;[[Image:Screen Shot 2012-11-20 at 9.57.51 AM.png|right|150px]]&lt;/div&gt;&lt;/td&gt;&lt;/tr&gt;
&lt;/table&gt;</summary>
		<author><name>Maechlin</name></author>
		
	</entry>
	<entry>
		<id>https://strike.scec.org/scecwiki/index.php?title=AWP_ODC_GPU&amp;diff=6189&amp;oldid=prev</id>
		<title>Maechlin at 18:01, 20 November 2012</title>
		<link rel="alternate" type="text/html" href="https://strike.scec.org/scecwiki/index.php?title=AWP_ODC_GPU&amp;diff=6189&amp;oldid=prev"/>
		<updated>2012-11-20T18:01:12Z</updated>

		<summary type="html">&lt;p&gt;&lt;/p&gt;
&lt;table class=&quot;diff diff-contentalign-left&quot; data-mw=&quot;interface&quot;&gt;
				&lt;col class=&quot;diff-marker&quot; /&gt;
				&lt;col class=&quot;diff-content&quot; /&gt;
				&lt;col class=&quot;diff-marker&quot; /&gt;
				&lt;col class=&quot;diff-content&quot; /&gt;
				&lt;tr class=&quot;diff-title&quot; lang=&quot;en&quot;&gt;
				&lt;td colspan=&quot;2&quot; style=&quot;background-color: #fff; color: #222; text-align: center;&quot;&gt;← Older revision&lt;/td&gt;
				&lt;td colspan=&quot;2&quot; style=&quot;background-color: #fff; color: #222; text-align: center;&quot;&gt;Revision as of 18:01, 20 November 2012&lt;/td&gt;
				&lt;/tr&gt;&lt;tr&gt;&lt;td colspan=&quot;2&quot; class=&quot;diff-lineno&quot; id=&quot;mw-diff-left-l1&quot; &gt;Line 1:&lt;/td&gt;
&lt;td colspan=&quot;2&quot; class=&quot;diff-lineno&quot;&gt;Line 1:&lt;/td&gt;&lt;/tr&gt;
&lt;tr&gt;&lt;td class='diff-marker'&gt; &lt;/td&gt;&lt;td style=&quot;background-color: #f8f9fa; color: #222; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #eaecf0; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;div&gt;Efecan Poyraz, a research intern working with Yifeng Cui, Kim Olsen, and others on SCEC projects on SCEC's first major GPU simulation in preparation. Efecan is trying to run a full scale Chino Hills validation ground motion simulation, at size of 3500x2500x1500 problem size, 16m resolution, we used 250 GPUs on Keeneland, the initial test went well. We are seeing 55x speedup comparing to the CPU code for the same setting, we know Keeneland may have some scaling issue when using high number of nodes, plus this simulation has serious I/O - around 0.5TB, requiring frequent data transfer from GPU to CPU through PCIe. However, we are pretty happy with the performance getting so far. One issue with current GPU code is that we don't have validation prints about velocity max/min value, for that the simulation results couldn't be validated at run time. We are working with our visualization expert to analyze the initial results, and also work with seismologist Kim Olsen. Once everything is set, we'll make reservation next week or so for a full size run which possibly take 5 hours to complete.&lt;/div&gt;&lt;/td&gt;&lt;td class='diff-marker'&gt; &lt;/td&gt;&lt;td style=&quot;background-color: #f8f9fa; color: #222; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #eaecf0; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;div&gt;Efecan Poyraz, a research intern working with Yifeng Cui, Kim Olsen, and others on SCEC projects on SCEC's first major GPU simulation in preparation. Efecan is trying to run a full scale Chino Hills validation ground motion simulation, at size of 3500x2500x1500 problem size, 16m resolution, we used 250 GPUs on Keeneland, the initial test went well. We are seeing 55x speedup comparing to the CPU code for the same setting, we know Keeneland may have some scaling issue when using high number of nodes, plus this simulation has serious I/O - around 0.5TB, requiring frequent data transfer from GPU to CPU through PCIe. However, we are pretty happy with the performance getting so far. One issue with current GPU code is that we don't have validation prints about velocity max/min value, for that the simulation results couldn't be validated at run time. We are working with our visualization expert to analyze the initial results, and also work with seismologist Kim Olsen. Once everything is set, we'll make reservation next week or so for a full size run which possibly take 5 hours to complete.&lt;/div&gt;&lt;/td&gt;&lt;/tr&gt;
&lt;tr&gt;&lt;td class='diff-marker'&gt;−&lt;/td&gt;&lt;td style=&quot;color: #222; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #ffe49c; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;div&gt;[[Image:&lt;del class=&quot;diffchange diffchange-inline&quot;&gt;usgs&lt;/del&gt;-&lt;del class=&quot;diffchange diffchange-inline&quot;&gt;logo&lt;/del&gt;-&lt;del class=&quot;diffchange diffchange-inline&quot;&gt;color&lt;/del&gt;.&lt;del class=&quot;diffchange diffchange-inline&quot;&gt;jpg&lt;/del&gt;|right|150px]]&lt;/div&gt;&lt;/td&gt;&lt;td class='diff-marker'&gt;+&lt;/td&gt;&lt;td style=&quot;color: #222; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #a3d3ff; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;div&gt;[[Image:&lt;ins class=&quot;diffchange diffchange-inline&quot;&gt;Screen Shot 2012&lt;/ins&gt;-&lt;ins class=&quot;diffchange diffchange-inline&quot;&gt;11&lt;/ins&gt;-&lt;ins class=&quot;diffchange diffchange-inline&quot;&gt;20 at 9&lt;/ins&gt;.&lt;ins class=&quot;diffchange diffchange-inline&quot;&gt;57.51 AM.png&lt;/ins&gt;|right|150px]]&lt;/div&gt;&lt;/td&gt;&lt;/tr&gt;
&lt;/table&gt;</summary>
		<author><name>Maechlin</name></author>
		
	</entry>
	<entry>
		<id>https://strike.scec.org/scecwiki/index.php?title=AWP_ODC_GPU&amp;diff=6188&amp;oldid=prev</id>
		<title>Maechlin at 18:00, 20 November 2012</title>
		<link rel="alternate" type="text/html" href="https://strike.scec.org/scecwiki/index.php?title=AWP_ODC_GPU&amp;diff=6188&amp;oldid=prev"/>
		<updated>2012-11-20T18:00:34Z</updated>

		<summary type="html">&lt;p&gt;&lt;/p&gt;
&lt;table class=&quot;diff diff-contentalign-left&quot; data-mw=&quot;interface&quot;&gt;
				&lt;col class=&quot;diff-marker&quot; /&gt;
				&lt;col class=&quot;diff-content&quot; /&gt;
				&lt;col class=&quot;diff-marker&quot; /&gt;
				&lt;col class=&quot;diff-content&quot; /&gt;
				&lt;tr class=&quot;diff-title&quot; lang=&quot;en&quot;&gt;
				&lt;td colspan=&quot;2&quot; style=&quot;background-color: #fff; color: #222; text-align: center;&quot;&gt;← Older revision&lt;/td&gt;
				&lt;td colspan=&quot;2&quot; style=&quot;background-color: #fff; color: #222; text-align: center;&quot;&gt;Revision as of 18:00, 20 November 2012&lt;/td&gt;
				&lt;/tr&gt;&lt;tr&gt;&lt;td colspan=&quot;2&quot; class=&quot;diff-lineno&quot; id=&quot;mw-diff-left-l1&quot; &gt;Line 1:&lt;/td&gt;
&lt;td colspan=&quot;2&quot; class=&quot;diff-lineno&quot;&gt;Line 1:&lt;/td&gt;&lt;/tr&gt;
&lt;tr&gt;&lt;td class='diff-marker'&gt; &lt;/td&gt;&lt;td style=&quot;background-color: #f8f9fa; color: #222; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #eaecf0; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;div&gt;Efecan Poyraz, a research intern working with Yifeng Cui, Kim Olsen, and others on SCEC projects on SCEC's first major GPU simulation in preparation. Efecan is trying to run a full scale Chino Hills validation ground motion simulation, at size of 3500x2500x1500 problem size, 16m resolution, we used 250 GPUs on Keeneland, the initial test went well. We are seeing 55x speedup comparing to the CPU code for the same setting, we know Keeneland may have some scaling issue when using high number of nodes, plus this simulation has serious I/O - around 0.5TB, requiring frequent data transfer from GPU to CPU through PCIe. However, we are pretty happy with the performance getting so far. One issue with current GPU code is that we don't have validation prints about velocity max/min value, for that the simulation results couldn't be validated at run time. We are working with our visualization expert to analyze the initial results, and also work with seismologist Kim Olsen. Once everything is set, we'll make reservation next week or so for a full size run which possibly take 5 hours to complete.&lt;/div&gt;&lt;/td&gt;&lt;td class='diff-marker'&gt; &lt;/td&gt;&lt;td style=&quot;background-color: #f8f9fa; color: #222; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #eaecf0; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;div&gt;Efecan Poyraz, a research intern working with Yifeng Cui, Kim Olsen, and others on SCEC projects on SCEC's first major GPU simulation in preparation. Efecan is trying to run a full scale Chino Hills validation ground motion simulation, at size of 3500x2500x1500 problem size, 16m resolution, we used 250 GPUs on Keeneland, the initial test went well. We are seeing 55x speedup comparing to the CPU code for the same setting, we know Keeneland may have some scaling issue when using high number of nodes, plus this simulation has serious I/O - around 0.5TB, requiring frequent data transfer from GPU to CPU through PCIe. However, we are pretty happy with the performance getting so far. One issue with current GPU code is that we don't have validation prints about velocity max/min value, for that the simulation results couldn't be validated at run time. We are working with our visualization expert to analyze the initial results, and also work with seismologist Kim Olsen. Once everything is set, we'll make reservation next week or so for a full size run which possibly take 5 hours to complete.&lt;/div&gt;&lt;/td&gt;&lt;/tr&gt;
&lt;tr&gt;&lt;td colspan=&quot;2&quot;&gt; &lt;/td&gt;&lt;td class='diff-marker'&gt;+&lt;/td&gt;&lt;td style=&quot;color: #222; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #a3d3ff; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;div&gt;&lt;ins style=&quot;font-weight: bold; text-decoration: none;&quot;&gt;[[Image:usgs-logo-color.jpg|right|150px]]&lt;/ins&gt;&lt;/div&gt;&lt;/td&gt;&lt;/tr&gt;
&lt;/table&gt;</summary>
		<author><name>Maechlin</name></author>
		
	</entry>
	<entry>
		<id>https://strike.scec.org/scecwiki/index.php?title=AWP_ODC_GPU&amp;diff=6129&amp;oldid=prev</id>
		<title>Maechlin at 01:58, 3 November 2012</title>
		<link rel="alternate" type="text/html" href="https://strike.scec.org/scecwiki/index.php?title=AWP_ODC_GPU&amp;diff=6129&amp;oldid=prev"/>
		<updated>2012-11-03T01:58:12Z</updated>

		<summary type="html">&lt;p&gt;&lt;/p&gt;
&lt;table class=&quot;diff diff-contentalign-left&quot; data-mw=&quot;interface&quot;&gt;
				&lt;col class=&quot;diff-marker&quot; /&gt;
				&lt;col class=&quot;diff-content&quot; /&gt;
				&lt;col class=&quot;diff-marker&quot; /&gt;
				&lt;col class=&quot;diff-content&quot; /&gt;
				&lt;tr class=&quot;diff-title&quot; lang=&quot;en&quot;&gt;
				&lt;td colspan=&quot;2&quot; style=&quot;background-color: #fff; color: #222; text-align: center;&quot;&gt;← Older revision&lt;/td&gt;
				&lt;td colspan=&quot;2&quot; style=&quot;background-color: #fff; color: #222; text-align: center;&quot;&gt;Revision as of 01:58, 3 November 2012&lt;/td&gt;
				&lt;/tr&gt;&lt;tr&gt;&lt;td colspan=&quot;2&quot; class=&quot;diff-lineno&quot; id=&quot;mw-diff-left-l1&quot; &gt;Line 1:&lt;/td&gt;
&lt;td colspan=&quot;2&quot; class=&quot;diff-lineno&quot;&gt;Line 1:&lt;/td&gt;&lt;/tr&gt;
&lt;tr&gt;&lt;td class='diff-marker'&gt;−&lt;/td&gt;&lt;td style=&quot;color: #222; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #ffe49c; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;div&gt;Efecan Poyraz &lt;del class=&quot;diffchange diffchange-inline&quot;&gt;is &lt;/del&gt;a research intern working with Yifeng Cui, Kim Olsen, and others on SCEC projects on SCEC's first major GPU simulation in preparation. Efecan is trying to run a full scale Chino Hills validation ground motion simulation, at size of 3500x2500x1500 problem size, 16m resolution, we used 250 GPUs on Keeneland, the initial test went well. We are seeing 55x speedup comparing to the CPU code for the same setting, we know Keeneland may have some scaling issue when using high number of nodes, plus this simulation has serious I/O - around 0.5TB, requiring frequent data transfer from GPU to CPU through PCIe. However, we are pretty happy with the performance getting so far. One issue with current GPU code is that we don't have validation prints about velocity max/min value, for that the simulation results couldn't be validated at run time. We are working with our visualization expert to analyze the initial results, and also work with seismologist Kim Olsen. Once everything is set, we'll make reservation next week or so for a full size run which possibly take 5 hours to complete.&lt;/div&gt;&lt;/td&gt;&lt;td class='diff-marker'&gt;+&lt;/td&gt;&lt;td style=&quot;color: #222; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #a3d3ff; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;div&gt;Efecan Poyraz&lt;ins class=&quot;diffchange diffchange-inline&quot;&gt;, &lt;/ins&gt;a research intern working with Yifeng Cui, Kim Olsen, and others on SCEC projects on SCEC's first major GPU simulation in preparation. Efecan is trying to run a full scale Chino Hills validation ground motion simulation, at size of 3500x2500x1500 problem size, 16m resolution, we used 250 GPUs on Keeneland, the initial test went well. We are seeing 55x speedup comparing to the CPU code for the same setting, we know Keeneland may have some scaling issue when using high number of nodes, plus this simulation has serious I/O - around 0.5TB, requiring frequent data transfer from GPU to CPU through PCIe. However, we are pretty happy with the performance getting so far. One issue with current GPU code is that we don't have validation prints about velocity max/min value, for that the simulation results couldn't be validated at run time. We are working with our visualization expert to analyze the initial results, and also work with seismologist Kim Olsen. Once everything is set, we'll make reservation next week or so for a full size run which possibly take 5 hours to complete.&lt;/div&gt;&lt;/td&gt;&lt;/tr&gt;
&lt;/table&gt;</summary>
		<author><name>Maechlin</name></author>
		
	</entry>
	<entry>
		<id>https://strike.scec.org/scecwiki/index.php?title=AWP_ODC_GPU&amp;diff=6122&amp;oldid=prev</id>
		<title>Maechlin at 15:46, 25 October 2012</title>
		<link rel="alternate" type="text/html" href="https://strike.scec.org/scecwiki/index.php?title=AWP_ODC_GPU&amp;diff=6122&amp;oldid=prev"/>
		<updated>2012-10-25T15:46:00Z</updated>

		<summary type="html">&lt;p&gt;&lt;/p&gt;
&lt;table class=&quot;diff diff-contentalign-left&quot; data-mw=&quot;interface&quot;&gt;
				&lt;col class=&quot;diff-marker&quot; /&gt;
				&lt;col class=&quot;diff-content&quot; /&gt;
				&lt;col class=&quot;diff-marker&quot; /&gt;
				&lt;col class=&quot;diff-content&quot; /&gt;
				&lt;tr class=&quot;diff-title&quot; lang=&quot;en&quot;&gt;
				&lt;td colspan=&quot;2&quot; style=&quot;background-color: #fff; color: #222; text-align: center;&quot;&gt;← Older revision&lt;/td&gt;
				&lt;td colspan=&quot;2&quot; style=&quot;background-color: #fff; color: #222; text-align: center;&quot;&gt;Revision as of 15:46, 25 October 2012&lt;/td&gt;
				&lt;/tr&gt;&lt;tr&gt;&lt;td colspan=&quot;2&quot; class=&quot;diff-lineno&quot; id=&quot;mw-diff-left-l1&quot; &gt;Line 1:&lt;/td&gt;
&lt;td colspan=&quot;2&quot; class=&quot;diff-lineno&quot;&gt;Line 1:&lt;/td&gt;&lt;/tr&gt;
&lt;tr&gt;&lt;td class='diff-marker'&gt;−&lt;/td&gt;&lt;td style=&quot;color: #222; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #ffe49c; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;div&gt;Efecan is a research intern working with &lt;del class=&quot;diffchange diffchange-inline&quot;&gt;me &lt;/del&gt;on SCEC projects&lt;del class=&quot;diffchange diffchange-inline&quot;&gt;. Thanks for the support to him as well as to our &lt;/del&gt;first major GPU simulation in preparation. &lt;del class=&quot;diffchange diffchange-inline&quot;&gt;What &lt;/del&gt;Efecan is trying to run &lt;del class=&quot;diffchange diffchange-inline&quot;&gt;is &lt;/del&gt;a full scale Chino Hills validation ground motion simulation, at size of 3500x2500x1500 problem size, 16m resolution, we used 250 GPUs on Keeneland, the initial test went well. We are seeing 55x speedup comparing to the CPU code for the same setting, we know Keeneland may have some scaling issue when using high number of nodes, plus this simulation has serious I/O - around 0.5TB, requiring frequent data transfer from GPU to CPU through PCIe. However, we are pretty happy with the performance getting so far. One issue with current GPU code is that we don't have validation prints about velocity max/min value, for that the simulation results couldn't be validated at run time. We are working with our visualization expert to analyze the initial results, and also work with seismologist Kim Olsen. Once everything is set, we'll make reservation next week or so for a full size run which possibly take 5 hours to complete.&lt;/div&gt;&lt;/td&gt;&lt;td class='diff-marker'&gt;+&lt;/td&gt;&lt;td style=&quot;color: #222; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #a3d3ff; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;div&gt;Efecan &lt;ins class=&quot;diffchange diffchange-inline&quot;&gt;Poyraz &lt;/ins&gt;is a research intern working with &lt;ins class=&quot;diffchange diffchange-inline&quot;&gt;Yifeng Cui, Kim Olsen, and others &lt;/ins&gt;on SCEC projects &lt;ins class=&quot;diffchange diffchange-inline&quot;&gt;on SCEC's &lt;/ins&gt;first major GPU simulation in preparation. Efecan is trying to run a full scale Chino Hills validation ground motion simulation, at size of 3500x2500x1500 problem size, 16m resolution, we used 250 GPUs on Keeneland, the initial test went well. We are seeing 55x speedup comparing to the CPU code for the same setting, we know Keeneland may have some scaling issue when using high number of nodes, plus this simulation has serious I/O - around 0.5TB, requiring frequent data transfer from GPU to CPU through PCIe. However, we are pretty happy with the performance getting so far. One issue with current GPU code is that we don't have validation prints about velocity max/min value, for that the simulation results couldn't be validated at run time. We are working with our visualization expert to analyze the initial results, and also work with seismologist Kim Olsen. Once everything is set, we'll make reservation next week or so for a full size run which possibly take 5 hours to complete.&lt;/div&gt;&lt;/td&gt;&lt;/tr&gt;
&lt;/table&gt;</summary>
		<author><name>Maechlin</name></author>
		
	</entry>
	<entry>
		<id>https://strike.scec.org/scecwiki/index.php?title=AWP_ODC_GPU&amp;diff=6121&amp;oldid=prev</id>
		<title>Maechlin: Created page with 'Efecan is a research intern working with me on SCEC projects. Thanks for the support to him as well as to our first major GPU simulation in preparation. What Efecan is trying to …'</title>
		<link rel="alternate" type="text/html" href="https://strike.scec.org/scecwiki/index.php?title=AWP_ODC_GPU&amp;diff=6121&amp;oldid=prev"/>
		<updated>2012-10-25T15:44:49Z</updated>

		<summary type="html">&lt;p&gt;Created page with &amp;#039;Efecan is a research intern working with me on SCEC projects. Thanks for the support to him as well as to our first major GPU simulation in preparation. What Efecan is trying to …&amp;#039;&lt;/p&gt;
&lt;p&gt;&lt;b&gt;New page&lt;/b&gt;&lt;/p&gt;&lt;div&gt;Efecan is a research intern working with me on SCEC projects. Thanks for the support to him as well as to our first major GPU simulation in preparation. What Efecan is trying to run is a full scale Chino Hills validation ground motion simulation, at size of 3500x2500x1500 problem size, 16m resolution, we used 250 GPUs on Keeneland, the initial test went well. We are seeing 55x speedup comparing to the CPU code for the same setting, we know Keeneland may have some scaling issue when using high number of nodes, plus this simulation has serious I/O - around 0.5TB, requiring frequent data transfer from GPU to CPU through PCIe. However, we are pretty happy with the performance getting so far. One issue with current GPU code is that we don't have validation prints about velocity max/min value, for that the simulation results couldn't be validated at run time. We are working with our visualization expert to analyze the initial results, and also work with seismologist Kim Olsen. Once everything is set, we'll make reservation next week or so for a full size run which possibly take 5 hours to complete.&lt;/div&gt;</summary>
		<author><name>Maechlin</name></author>
		
	</entry>
</feed>